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METHOD FOR IDENTIFYING METASTATIC SEQUENCES 

Rights in the fnvPnt.rm 

This invention was made in part with United States 
Government support under grant number CA350129, awarded by the 
National Cancer Institute, National Institute of Health and the United States 
Government has certain rights in the invention. 
Background 

1 Field of the Invention 

The present invention relates to methods for the identification 
and isolation of metastatic sequences, to diagnostic probes and kits which 
contain metastatic sequences and to therapeutic treatments for neoplastic 
disorders based on metastatic sequences. 
2 Description of the Background 

The development of higher onanisms is characterized by an 
exquisite pattern of temporal and spatially regulated cell division 
Eruptions in the normal physiology of cell division are almost invariably 
detrimental. One such type of disruption is cancer, a disease that can anse 
from a series of genetic events. 

Cancer cells are defined by two heritable properties 
uncontrolled growth and uncontrolled invasion of normal tissue A 
cancerous cell can divide m defiance of the normal growth constraints in a 
cell leading to a localized growth or tumor. In addition, some cancer cells 
also gain the ability to migrate away from their initial site and mvade other 
healthy tissues in a patient. It is the combination of these two features that 
25 make a cancer cell especially dangerous. 

An isolated abnormal cell population that grows uncontrollably 
w,l. give rise to a tumor or neoplasm. As long as the neoplasm remains ui 
a smgle location, it is said to be benign, and a complete cure may be 
expected by removing the mass surgically. A tumor or neoplasm is counted 
as a cancer if it ,s malignant, that is, if its cells have the ability to invade 
surrounding tissue. True malignancy begins when the cells cross the basal 
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lamina and begin to invade the underlying connective tissue. Malignancy 
occurs when the cells gain the ability to detach from the main tumor mass, 
enter the bloodstream or lymphatic vessels, and form secondary tumors or 
metastases at other sites in the body. The more widely a tumor metastasis, 
5 die harder it is to eradicate and treat. 

As determined from the epidemiological and clinical studies, 
most cancers develop in slow stages from mildly benign into malignant 
neoplasms. Malignant cancer usually begins as a benign localized cell 
population with abnormal growth characteristic called a dysplasia. The 

10 abnormal cells acquire abnormal growth characteristics resulting in a 
neoplasia characterized as a cell population of localized growth and 
swelling. If untreated, the neoplasia in situ may progress into a malignant 
neoplasia. Several years, or tens of years may elapse from the first sign of 
dysplasia to the onset of full blown malignant cancer. This characteristic 

15 process is observed in a number of cancers. Prostate cancer provides one of 
the more clear examples of the progression of normal tissue to benign 
neoplasm to malignant neoplasm. 

The walnut-sized prostate is an encapsulated organ of the 
mammalian male urogenital system. Located at the base of the bladder, the 

20 prostate is partitioned into zones referred to as the central, peripheral and 
transitional zones, all of which surround the urethra. Histologically, the 
prostate is a highly microvascularized gland comprising fairly large 
glandular spaces lined with epithelium which, along with the seminal 
vesicles, supply die majority of fluid to the male ejaculate. As an endocrine- 

25 dependent organ, the prostate responds to both the major male hormone, 
testosterone, and the major female hormones, estrogen and progesterone. 
Testicular androgen is considered important for prostate growth and 
development because, in both humans and other animals, castration leads to 
prostate atrophy and, in most cases, an absence of any incidence of prostatic 

30 carcinoma. 
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The major neoplastic disorders of the prostate are benign 
enlargement of the prostate, also called benign prostatic hyperplasia (BPH), 
and prostatic carcinoma; a type of neoplasia. BPH is very common in men 
over the age of 50. It is characterized by the presence of a number of large 
5 distinct nodules in the periurethral area of the prostate. Although benign and 
not malignant, these nodules can produce obstruction of the urethra causing 
nocturia, hesitancy to void, and difficulty in starting and stopping a urine 
stream upon voiding the bladder. Left untreated, a percentage of these 
prostate hyperplasia and neoplasias may develop into malignant prostate 
10 carcinoma. 

In its more aggressive form, transformed prostatic tissues 
escape from the prostate capsule and metastasize invading locally and 
throughout the bloodstream and lymphatic system. Metastasis, defined as 
tumor implants which are discontinuous with the primary tumor, can occur 
15 through direct seeding, lymphatic spread and hematogenous spread. All 
three routes have been found to occur with prostatic carcinoma. Local 
invasions typically involve the seminal vesicles, the base of the urinary 
bladder, and the urethra. Direct seeding occurs when a malignant neoplasm 
penetrates a natural open field such as the peritoneal, pleural or pericardial 
20 cavities. Cells seed along the surfaces of various organs and tissues within 
the cavity or can simply fill the cavity spaces. Hematogenous spread is 
typical of sarcomas and carcinomas. Hematogenous spread of prostatic 
carcinoma occurs primarily to the bones, but can include massive visceral 
invasion as well. It has been estimated that about 60% of newly diagnosed 
25 prostate cancer patients will have metastases at the time of initial diagnosis. 

Surgery or radiotherapy is the treatment of choice for early 
prostatic neoplasia. Surgery involves complete removal of the entire prostate 
(radical prostatectomy), and often removal of the surrounding lymph nodes, 
lymphadenectomy. Radiotherapy, occasionally used as adjuvant therapy, 
30 may be either external or interstitial using IM I. Endocrine therapy is the 
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treatment of choice for more advanced forms. The aim of this therapy is to 
deprive the prostate cells, and presumably the transformed prostate cells as 
well, of testosterone. This is accomplished by orchiectomy (castration) or 
administration of estrogens or synthetic hormones which are agonists of 
5 luteinizing hormone-releasing hormone. These cellular messengers directly 
inhibit testicular and organ synthesis and suppress luteinizing hormone 
secretion which in turn leads to reduced testosterone secretion by the testes. 
Despite the advances made in achieving a pharmacologic orchiectomy, the 
survival rates for those with late stage carcinomas are rather bleak. 



10 Summary of the Invention 

The present invention overcomes the problems and 
disadvantages associated with current strategies and designs and provides 
new methods for the identification of sequences related to metastasis. 

One embodiment of the invention is directed to methods for 

15 the identification of a metastatic sequence. One or more oncogenic 
sequences are transfected into a cell to form a transfected cell. The 
transfected cell is introduced into a primary site of a host animal to establish 
a colony which is incubated in the animal for a period of time sufficient to 
develop both a primary tumor and a malignant tumor. Expressed sequences 

20 are harvested from the primary tumor and the metastasis. Harvested 
sequences are compared to each other and to non-metastatic cells to identify 
sequences related to metastasis. Dominant metastatic genes are genes whose 
expression leads to metastasis. Such genes are typically expressed at high 
levels in metastatic cells and not significantly expressed in normal or 

25 nonmetastatic cells. Recessive metastatic genes, genes whose expression 
prevents metastasis, may be selectively expressed in normal and 
nonmetastatic cells and absent in metastatic cells. Dominant and recessive 
metastatic genes may act directly or act pleiotropically by enhancing or 
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inhibiting the expression or function of other dominant and recessive 
metastatic genes. 

Another embodiment of the invention is directed to methods 
for identifying metastatic sequences. A mammalia cell is treated with a 
5 metastatic agent and the treated cell is implanted into a primary site of a host 
mammal. The host animal is maintained for a period of time sufficient for 
the cells to proliferate and to develop a metastatsis at a secondary site. 
Expressed squences from cells of the primary cite and cells of the secondary 
site are reverse transcribed into cDNA by differential display polymerase 
10 cham reaction to identify differentially expressed sequences. 

Another embodiment of the invention is directed to sequences 
isolated by the methods of the invention. Sequences may be in the form of 
DNA, RNA or PNA. The nucleic acd may be smgle-stranded or double- 
stranded. Single stranded nucleic acid may be in the form of a sense strand 
or an antisense strand. In addition, the sequence may be part of a 
homologous recombination vector designed to recombine with another 
metastatic sequence. 

Another embodiment of the invention is directed to a method 
for treating a neoplastic disorder composing administering a 
pharmaceutical^ effective amount of a metastatic nucleic acid to a patient. 
The nucleic acid may be single-stranded in the sense or the antisense 
direction. Alternatively, the nucleic acd may be packaged in a viral vector 
such as, for example, a retroviral, a vaccinia or an adenoviral vector 
Administration may be performed by injection, pulmonary absorption, 
topical application or delayed release of the nucleic acid along with a 
pharmaceutical^ acceptable carrier such as water, alcohols, salts, oils, fatty 
acids, saccharides, polysaccharides and combinations thereof. 

Another embodiment of the invention is directed to a kit for 
detecting of the presence or absence of a metastatic sequence. 
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Other objects and advantages of the invention are set forth in 
part in the description which follows, and in part, will be obvious from this 
description, or may be learned from the practice of the invention. 

Description of the Drawings 
5 Figure 1 Schematic showing two paths in the multistep progression to 
cancer. 

Figure 2 Staining of primary tumor (A) and metastatic deposit (B) from 

the lung of the same animal 
Figure 3 Staining of normal human prostate (A), moderately 
10 differentiated human prostate tumor (B and C), and poorly 

differentiated prostate tumor (D). 
Figure 4 Schematic of method for isolating a metastatic gene from a 

gene ablated mouse strain. 
Figure 5 Schematic showing method to establish a tumor and a 
15 metastatic transplant from fetal tissue(A) and from cell lines 

and tumors (b). 

Figure 6 Isolation and characterization of nmb gene expression by DD- 

PCR and RNA blot in primary and metastatic cells. 
Figure 7 Differential expression of multiple genes is determined by 
20 DD-PCR and RNA blot of primary and metastatic cells. 

Figure 8 Caveolin identified as a differentially expressed gene by DD- 
PCR. 

Figure 9 Differential expression of genes isolated by DD-PCR 
confirmed by RNA blots. 
25 Figure 10 RNA blot analysis of total tumor mRNA using clone 29 
GADPH probes. 

Figure 1 1 RNA blot of three independent MPR metastatic tumors and 5 

MPR non-metastatic tumors. 
Figure 12 Nucleotide sequences of metastatic nucleic acids, 
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Figure 1 3 Characterization of metastatic sequences isolated. 
Figure 14 Immunohistological staining of primary and metastatic human 
prostate tumors using anti-caveolin antibodies. 

Description of the Invention 
5 As embodied and broadly described herein, the present 

invention is directed to methods for identifying metastatic sequences to the 
metastatic sequences identified, to methods for the detection, diagnosis and 
treatment of disorders related to metastasis, and to diagnostic kits which 
comprise these sequences. 

10 The ability of cancers to metastasize makes rumors difficult to 

eradicate by any means. Malignant cancer involves a multistage progression 
from, for example, normal tissue through hyperplasia, early adenoma, early 
carcinoma and finally to a metastatic tumor (Figure I). Cells of a typical 
tumor loosen their adhesion to their original cellular neighbors and cross the 
15 basal lamina and endothelial lining to enter the body's circulation Once in 
circulation, the metastatic cell exits from the circulation to d.ssem.nate 
throughout body and proliferate in a new environment. 

Like the initial oncogenic event, the ability of a cell to 
metastasize requtres additional mutationic or epigenetic changes An 
20 understanding of the molecular mechanisms of metastasis allow for the 
design of treatments to inhibit metastasis. Knowledge of stage specific gene 
expression for neoplastic disorders allows for early detection and typing of 
tumors. With early detection and typing, proper treatment may be 
admin.stered to a patient with the neoplastic disorder earlier, which will lead 
25 to a higher probability of a complete cure. 

For human prostate tumors, the study of stage specific tumors 
is difficult, ,f not impossible, as cell lines are extremely difficult to grow and 
■t is rare that tissue becomes available from the primary tumor as well as 
metastatic disease from the same patient. This problem is exacerbated 
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because of the infrequent biopsy of metastatic deposits in concordance of 
isolation of material from the primary tumor. Furthermore, the growth of 
cell lines from malignant prostates has proved to be problematic over the last 
few decades. This is evidenced by the lack of cell lines from prostate cancer 
5 obtained under any conditions. 

One embodiment of the invention is directed to a method for 
identifying a metastatic sequence. A mammalian cell is transformed into a 
pre-neoplastic or neoplastic state or phenotype by transfection with one or 
more oncogenic sequences. Alternatively, or in addition to transfection, the 
10 mammalian cell may be treated with an agent or subjected to a condition that 
potentiates the metastatic character of the cell or predisposes the cell to 
metastasis. The transfected or treated cell is implanted into a host animal at 
a primary site and grown for a period of time sufficient to develop a 
metastasis at a secondary site. Expressed sequences from cells of the 

15 primary site and cells at the secondary site are amplified by differential 
display polymerase chain reactions. PCR products from these reactions are 
compared and the metastatic sequence identified by alteration in the levels 
or patterns of the resulting products. 

Mammalian cells from a wide variety of tissue types and 

20 species are suitable for transfection or treatment including surgically 
obtained or primary or immortalized cells and cell lines. Cells may be from 
humans or primates, mice, rats, sheep, cows, rabbits, horses, pigs or guinea 
pigs or from transgenic or xenogeneic host mammals. Cells may be obtained 
from adult, juvenile or fetal tissue, and used directly from die mammal, from 

25 cryogenically preserved samples, or after culturing in vitro or in vivo for a 
period of time. In vitro culturing typically involves tissue culture conditions 
{e.g. 37°C; 5% C0 2 ) while in vivo culturing may involve successive passage 
of cells through host animals such as, for example, mice or rabbits. Cells 
passed in vivo may be obtained from sites proximal or distal to the site of 

30 implantation. The tissue type from which the cells are derived or obtained 



WO 97/18454 



PCT/US96/18567 



9 



15 



20 



25 



30 



may be any tissue which is susceptible to transfection or other treatment 
including, for example, urogenital tissues, epithelial cells, hepatic cells, 
fibroblasts lymphatic tissues, hematopoietic cells, cells of the immune 
system, cells of the gastrointestinal system and cells of the nervous system. 
5 Cell types useful for the identification of metastatic sequences 

related to prostrate cancer include cells and cell lines of the fetal prostate 
lineage from normal or transgenic animals, and cells from normal or 
reconstituted prostate tissue. One method of generating reconstituted 
prostate cells is to isolate fetal prostate tissue and microdissect the fetal 
) prostate epithelium away from fetal mesenchyme. Fetal prostate epitheliums 
may be genetically manipulated before reassociation with fetal mesenchyme 
(Figure 5A). Genetic manipulation involves treatment or transfection with 
a metastatic agent or a nucleic acid sequence that affects neoplastic or 
metastatic potential of the cell. Reassociation of fetal epithelium and 
mesenchyme is performed by implanting epithelium tissue within a pocket 
of mesenchyme tissue. After manipulation, cells are reimplanted into a 
mammalian host in a similar manner as other cells, such as reimplantation 
into or under the renal capsule. 

Mammalian cells may be transfected by a variety of 
techniques, all of which are well-known to those of ordinary skill. Direct 
methods involve the introduction of genetic material into the nucleus of a 
cell by injection. These techniques include high velocity projectile injection 
microinjection, and electroporation. Indirect methods, involving the active 
or passive uptake of the genetic information by the cell. Indirect techniques 
include transduction with recombinant vectors, and chemical or physical 
treatments such as calcium phosphate uptake, lipofection or dextran sulfate 
transfection. Chemical techniques rely on chemical earners to introduce 
nucleic acids into a cell. These methods, for example, utilize unilamellar 
phospholipid vesicles (e.g. liposomes) loaded with DNA (or RNA) The 
approach relies on the fusion of the DNA containing vesicles with the 



WO 97/18454 



PCT/US96/I8567 



10 

plasma membrane of the recipient cells. After entry, DNA traverse the 
cytoplasm and enter the nucleus. Another lipofection technique uses a 
synthetic cationic lipid such as N-[l-(2,3-dioleyloxy)propyl]-N,N,N- 
trimethylammonium chloride (DOTMA). DOTMA spontaneously associates 
5 with nucleic acids and forms unilamellar vesicles upon sonication. Genetic 
material is incorporated into these vesicles and subsequently transfected into 
the cell. Calcium phosphate co-precipitation involves mixing of purified 
nucleic acid with buffers containing phosphate and calcium chloride which 
results in the formation of a fine precipitate. Presentation of this precipitate 
10 to cells results in incorporation of the nucleic acid into cellular genome. 
Other chemicals, such as DEAE dextran or polybrene, when present in 
media with nucleic acids, can also cause the transfection of mammalian 
cells. 

Physical methods of transfection rely on electric fields, needles 
15 and particles to enable nucleic acids to traverse the cellular membrane. 
Electric field mediated DNA transfection, commonly called electroporation, 
is based on the principle that membranes, when subjected to an electric field, 
undergo a reversible breakdown resulting in pores large enough to permit the 
passage of nucleic acids. In micro-projectile mediated gene transfer, micro- 

20 projectiles of subcellular dimensions are coated with nucleic acid and 
propelled at high velocity into a cell using a particle gun. The nucleic acid 
is introduced into the nucleus directly when the particles impinge upon the 
nucleus. In microinjection, nucleic acid is injected directly into the nucleus 
of a cell with a needle. Lasers have also been used to introduce minute holes 

25 in cellular membrane to allow introduction of nucleic acids. All these 
methods may be used for transfection and the selection of the method will 
depend on the cell type, the desired transfection efficiency and the 
equipment available. 

The efficiency of transfection may be monitored and enhanced 

30 by the co-transfection of a selectable marker. If a marker is co-transfected 
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with a genetic construct, positively transformed cells may be separated from 
nontransformed cells by chemical selection. The efficiency of transfecrion 
will be increased in most cases because the chemicals will selectively kill 
non-transfected cells. The number of transfected cells may also be 
5 monitored by analyzing the degree of chemical resistance of the transfected 
cells. Markers commonly used for selection purposes include, for example 
nucleic acids encoding dihydrofolate reductase, metallothionein, CAD 
adenosine deaminase, adenylate deaminase, UMP synthetase, IMP 5'- 
dehydrogenase, xanthine-guanine phosphoribosyltransferase,' mutant 
0 thymidine kinase, mutant HGPRTase, thymidylate synthetase P- 
glycoprotein 170, ribonucleotide reductase, glutamine synthetase, asparagine 
synthetase, arginosuccinate synthetase, ornithine decarboxylase, HMG-CoA 
reductase, N-acetylglucosaminyl transferase, theronyl-tRNA synthetase 
sodium or potassium dependent ATPase or derivatives or mutants of these 
nucleic acids. Markers may be used individually or ui combination 
Chemicals useful for selection include methotrexate, cadmium, PALA Xyl- 
A, adenosine, 2'-deoxyc 0 form y cin, adenine, azaserine, coformycin 6- 
azaundine, pyrazofuran, mycophenolic acid, limiting xanthine 
hypoxanthine, aminopterin, thymidine, 5-fluorodeoxyuridine, adnamycm' 
vincristine, colchicme, actinomycin D, puromycin, cytocholasin B emetine' 
maytansine, Bakers' antifolate, aphidicoiin, methionine sulfoxiirune p.' 
aspartyl hydroxamate, albizziin, canavanine, a-difluoromcthylonuthinc 
compactm, tunicamycin, borrelidin, ouabain, and derivatives and analogs 
and combinations of these chemicals. Some chemicals, such as 
methotrexate, may be used individually while other chemicals, such as HAT 
(hypoxanthine, aminopterin and thymidine), need to be used in combination 
to be effective. 

The oncogene transfection efficiency, the fraction of live cells 
tranfected by an oncogene, may be indirectly enhanced by chemical 
selection for a co-transfected marker. An oncogene ,s a sequence which can 
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predispose, or induce the cell into a pre-neoplastic or neoplastic condition 
or otherwise enhance the metastatic potential of the cell. Sequences with 
these properties are referred to as oncogenes and include abl, ahi, akt, bcl, 
crk, dsi, erb, ets, evi, fes/fps, fim, fis, fgr, flv, fins, fas, gin, gli, in,, fan. kit. 
5 mas, Ick, met, mil/raf, mis, mlv, mos, myb. myc, neu, one, pirn, raf ras, rel, 
ros, seq, sis, ski, spi, sre, tel, thy, trk, and yes. Some oncogenes, such as ras, 
are oncogenic when mutated. Other oncogenes, such as myc, are oncogenic 
when overexpressed or underexpressed. Many oncogenes represent members 
of multigene families or homologs families. Homologs are proteins that 
10 have similar primary, secondary or tertiary structures. Genes may differ in 
nucleic acid sequence or encoded peptide sequence and still be homologs 
when the encoded polypeptides have similar spatial folding. Many 
oncogenes can be classified into dominant oncogenes and recessive 
oncogenes. One or more dominant oncogenes can confer a neoplastic or pre- 
15 neoplastic phenotype to a cell. One or more recessive oncogenes, when 
silenced, may also confer a neoplastic or preneoplastic phenotype. Gene 
siJencing is performed by transfecting cells with nucleic acids which cause 
genetic ablation or by antisense suppression. 

While any oncogene may be used, the preferred oncogenes are 
20 those that are normally associated with metastasis such as a metastasis 
specific gene. Such genes include for example, TGF-fil, Cyclm DJ p2J, 
p34, p53, lysy\ oxidase, caveolin, actin binding protein, ubiquitin activating 
enzyme El, nmb or a-actinin 3. Metastatic-specific genes may be used 
individually or in combination with other oncogenes. 
25 metastatic potential of a cell may be altered, for example, 

by gene ablation with a sequence specific for a recessive oncogene. 
Recessive oncogene are those genes which encode products which can 
suppress oncogenesis and metastasis. A gene ablation sequence can be 
designed to specifically suppress a recessive oncogene. Ablation may 
30 include pre-transcriptional inhibition such as homologous recombination 
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with endogenous recessive oncogenes and post transcriptional inhibition 
such as the expression of antisense oncogenes to suppress translation. Gene 
ablation sequences may be targeted towards well known recessive oncogenes 
such as, for example, the retinoblastoma gene (Rb) or Beg. Other candidates 
5 for ablation include metastatic genes previously isolated by the invention 
such as, for example, TGF- P «, cyclin Dl, P 21, P 34, P 53, lysyl oxidase, 
caveol.n, actin binding protein, ubiquitin activating enzyme El, nmb or a- 
actuun-3. The effects of ablating a recessive oncogene may include 
oncogenesis and metastases. 

10 Alternatively, or in addition to transfecting the mammalian cell 

may be treated with an agent, either before or after transfection, that alters 
the expression of the cell's nucleic acids. Treatment may comprise 
contacting the cells with one or more agents which affect the neoplastic (e.g. 
neoplastic agents; phorbol esters), metabolization (e.g. metabolic agents) 
15 metastatic (e.g. metastatic agents), differentiation {e.g. differentiation agents' 
retmoic acid), activation or proliferation (e.g. growth factors) of the cell.' 
Agents which can alter gene expression include chemicals such as 
benzanthracene (BA), dimethyl benzanthracene (DMBA) or 5-azacytidine 
Alternatively, treatment may also comprise altered conditions such as 
20 hypoxia which involves subjecting a cell to a reduced oxygen content, 
exposable to radiation or other stresses to the cell. 

Treatment may be in vitro or in vivo and may include for 
example, direct or indirect induction or suppression of well know oncogenic 
sequences and genes isolated by the invention such as, for example, TGF-fi] 
25 Cyclin DI, P 53, lysyl oxidase, caveolin, actin binding protein, ubiquitin 
activating enzyme El. nmb, aactininS, and P 34. Gene expression induction 
.ncludes transfecting expression vectors encompassing coding regions of the 
gene. Gene repression comprises introducing a gene ablation sequence or 
a repressor of the gene to the cell. 
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Cells which have one or more genes ablated may also be used. 
For example, a metastatic suppressor gene may be ablated to prevent 
inhibition to metastases. A useful gene for ablation is a gene capable of 
affecting the phenotype and behavior of a cell or tumor. For example, with 
5 prostate tumors, suitable genes include both well known genes and genes 
isolated by the methods of the invention such as for example, TGF-pl, 
Cyclin Dl, p2I, p34, p53, lysyl oxidase, caveolin, actin binding protein, 
ubiquiim activating enzyme El, nmb and a actinm 3. Genetic ablation (gene 
knockout) refers to a process of silencing the expression of a particular gene 
10 in a cell. The silencing process may include, for example, gene targeting or 
antisense blocking. Gene targeting refers to a process of introducing a 
nucleic acid construct into a cell to specifically recombine with a target gene. 
The nucleic acid construct inactivates the targeted gene. Inactivation may 
be by introduction of termination codons into a coding region or introduction 
15 of a repression site into a regulatory sequence. Antisense blocking refers to 
the incorporation into a cell of expression sequences which directs the 
synthesis of antisense RNA to block expression of a target gene. Antisense 
RNA hybridizes to the mRNA of the target gene to inhibit expression. 

The host animal is preferably the same species as the 
20 implanted cell. In cases of xenogeneic transplants, the host may be 
immunocompromised genetically or by treatment with drugs such as 
immunosuppressants. A host may be immunocompromised genetically by 
breeding such as with nude mice or severe combined immunodeficient 
(SCID) mice. A host may also be immunocompromised by chemical or 
25 irradiation methods. An additional route to immunocompromise a host is to 
use transgenic technology to introduce an immunosuppressing gene or to 
introduce a foreign antigen gene. An immunosuppressing gene is a gene that 
affects the efficiency of the immune system such as a gene which inhibits the 
formation of cells of the B cell or T cell lineage. A foreign antigen gene, 
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when expressed, may cause the host to tolerate the antigens in a xenogeneic 
transplant and not mount an immune response. 

Cells may be implanted into any primary site in a host animal 
such as, for example, subcutaneous implantation, intravenous injection or 
5 implantation into the abdominal cardiac, chest, pulmonary, thoracic' or 
pentoneal cavity. Using techniques known to those of ordinary skill in the 
art, cells can be placed on or in nearly any organ or tissue. Reasons for 
choosing a site include ease of implant, proxmuty of similar tissue type 
•mmunoprmleged position and ease of inspection. Metastasises migrate 
10 from the primary site to one or more secondary sites such as, for example 
the lung, kidney, liver, lymph nodes, brain, testis, bone, spleen, ovanes or 
mammary. Preferred sites include the renal capsule, the testes, the prostate 
and the ovaries. 

To avoid histocompatibility problems, the implant may be 
15 placed uuo a histocompatible host animal. Such problems are generally 
avoided if the host animal are syngeneic. Alternatively a non- 
histocompatible host may be used if the host can be made immunotoierant 
Hosts may also be transgenic or immunocompromised arumals or genetically 
matched to the mammalian cells to be introduced. Immunocompromised 
» animals may be derived from es , ab ,i shed mouse lines such „ ^ ^ ^ 
severe combined immune deficiency (SCID) mice, or by treatments such as 
radiation, chemical, pharmaceutical or genetic targeting. Sufficiently 
"nmunosuppressed animals can be made tolerant to xenogeneic transplants 
After implantation the host animal is maintained under normal 
5 conditions to develop metastases. A.ternatively, the host animal may be 
subjected to an altered treatment or environmental condition to stimulate or 
repress metastasis or induce other cellular factions. In metastasis asub- 
population of cells of the unplantation site invade and establish one or more 
secondary colonies in the host animal. The behavior of the implanted cell 
will depend on tine cell type, the transfected sequence and the implantation 
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location. Typical secondary sites for metastatic colonies include lung, 
kidney, liver, lymph nodes, brain, testis, spleen, bone, ovary, skin and 
mammary tissue. Metastatic development times vary from days to weeks 
even months. Cells with a high metastatic potential tend to progress to 
metastasis quickly while cells with a low metastatic potential may require 
very long periods of time that span significant portions of the lifespan of the 
animal. 

The host animal may be analyzed for metastatic development 
weekly, from one week to 20 weeks to six months, nine months or one year 
after implantation. For animals with longer lifespans such as sheep, the 
animal may be inspected yearly from one year on up to ten years for 
metastatic tumors. Metastases can be detected by examinations such as 
palpitation, biopsy, imaging, exploratory surgery, CAT scans, autopsy, X-ray 
and direct observation. In addition, tissue samples may be taken surgically 
15 from the host mammal and subjected to histological or other examination for 
the detection of metastases. 

Expressed sequences include mRNA, rRNA, hnRNA, DNA, 
cDNA and any nucleic acid sequence that is expressed in the cell. These 
sequences may be amplified by in situ techniques or by purification of 
20 nucleic acid from collected cells. Expressed sequences may be obtained by 
extracting nucleic acids from cells before implantation, at the primary site 
or at the secondary site. Cells collected at these sites may optionally be 
cultured for a time before nucleic acid extraction. The effects of treatment 
with gene expression modifying agents or environmental conditions can be 
25 ascertained by collecting cells before and after treatment. Treatment may be 
applied to the cells while the cells are in the host mammal or after the cells 
are excised and in culture. Nucleic acid are collected from cells using 
techniques that are well known to those of ordinary skill in the art. 

Expressed sequences may be used directly for polymerase 
30 chain reaction (PCR) analysis using, for example, the technique of reverse 
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transcriptase polymerase chain reaction (RT-PCR). Alternatively, RNA may 
be enriched for mRNA using a poly-A RNA enrichment method. Numerous 
poly-A RNA enrichment methods exist and are commercially available 
Techniques used for poly-A RNA ennchment include oligo-dT columns 
5 oligo-dT magnetic beads, and oligo-dT cellulose. RNA may be further 
processed into cDNA before analysis by reverse transcription using reverse 
transcriptase. The cells or the extracted nucleic acid may be preserved, such 
as by freezing, and analyzed at a later time. 

Differential display polymerase chain reactions (DD-PCR) are 
10 performed on the expressed sequences using two variable pnmers which 
may contain the same or entirely different sequences or an anchor pnmer 
and a variable pnmer. If an anchor primer is used, one anchor pnmer and 
one vanable pnmer create a single or a single set of reaction products for 
each reaction. A complete profile may include 25 or more different PCR 
15 reactions per sample wherein each PCR reaction ,s performed with the same 
anchor pnmer and a different variable pnmer. DD-PCR may also be 
performed using anchor and variable pnmers which contain the same 
sequence. Whether a particular reaction ,s used depends on whether a 
difference exists between the products of two PCR reactions using the same 
20 pnmers. When a significant difference exists between the expression 
sequences amplified, one pair of PCR reactions may be sufficient and 
informative. 

Anchor primers are preferably oligonucleotides with a poly-T 
sequence at the 5" -temunas and a dinuc.eotide selected from the group 

25 consisting of AA, AG, AC, AT, GA, GG, GC, GT, CA, CG, CC and CT at 
the j -temunas. For example, the sequence may be 5-TTTTTTAA-3' or 5'. 
TTTTTTAG-3'. The length of the poly-T sequence is typically between" 
about 5 to about 30 bases m length and preferably between about 10 to about 
20 nucleotides long. The total .ength of the anchor pnmer can vary greatly 

-0 for each experiment but is preferably between about 7 to about 32 and more 
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preferably between about 12 and about 22. Differential diagnostic 
polymerase chain reaction may also be performed using an anchor primer of 
any sequence and a length between about 5 to about 30, preferably between 
about 5 to about 20 and more preferably between about 7 to about 1 2 bases. 

The variable primer may comprise a random sequence, or a 
specific sequence such as, for example, a sequence of SEQ ID NO. 1 to SEQ 
ID NO. 24. Variable primers preferably are oligonucleotides with a length 
between about 5 to about 30, preferably between about 5 to about 20, and 
more preferably between about 7 to about 12 bases in length. 

To enhance detection of the PCR product, the anchor primer 
or the variable primer, or both, may comprise a detectable moiety. Examples 
of detectable moieties include radioactive moieties, phosphorescent moieties, 
magnetic moieties, luminescent moieties, conjugatabte moieties or other 
detectable moiety. A plurality of detectable moieties may be used to 
15 enhance detection or to simplify data analysis. Other detectable moieties 
include conjugatable moieties and molecules which can bind specifically to 
other molecules which are themselves detectable. Examples of conjugatable 
moieties include avidin, streptavidin, biotin, antibody, antigen, cell adhesion 
molecules and other molecules with similar activities. Detectable moieties 
20 are preferably labeled nucleotides. A nucleotide may be any natural or 
synthetic nucleotide or nucleotide analog capable of incorporation into an 
elongation reaction in a polymerase chain reaction. Labeled nucleotides 
include nucleotide triphosphates labeled with one or more radioactive atoms 
such as 32 P, ,3 P, 3 H, ,4 C and 35 S. 
25 Products of DD-PCR reactions are compared to detect the 

metastatic sequence. Comparisons can be performed between expressed 
sequences from cells at secondary sites with cells at any stage in the method 
including untreated mammalian cells, transfected or treated mammalian 
cells, implanted cells or cells obtained from the primary site in the host 
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animal. DD-PCR products may be analyzed by any method which reliably 
compares the products of two polymerase chain reactions. Typical analytical 
methods used for this purpose include polyacrylamide gel electrophoresis, 
capillary electrophoresis and high pressure liquid chromatography (HPLC) 
5 Product produced from DD-PCR may be analyzed in double-stranded or 
single-stranded forms. When the products of the DD-PCR reaction are 
labeled the sizes and distribution of the products may be monitored and 
analyzed by following the labels using a radiation monitor or by 
autorad,ography. For example, DD-PCR performed in the presence of 
) rad.oactive pnmers or nucleotide triphosphates, can be analyzed by gel 
electrophoresis, by capillary electrophoresis, or by HPLC. Products are 
easily monitored by the presence of radioactivity. 

Another method for analyzing and isolating metastatic 
sequences is to sequence the amplified nucleic acid sequences. Sequencing 
may be performed usmg standard methods well known to those of ordinary 
skill in the art. The resulting sequence may be compared to a sequence 
database created or well-known, such as Genbank, for identification or for 
locating homologs. The sequencing information may be used to calculate 
the physical characteristics of the nucleic acids such as melting temperature 
and secondary structure. The primary sequence and the physical 
characterise may be used to synthesize optimal nucleic acd probes for the 
detect.cn or staging of metastasis or conditions that are predictive of the 
presence or absence of the metastatic condition. 

Another embodiment of the invention is directed to a methods 
for .dentifying a metastatic sequence. A mammalian cell is predated with 
a metastatic agent to form a population of cells predisposed to metastasize 
The treated cells are introduced into a host mammal at a primary site The 
host animal is maintained for a period of time sufficient to develop a 
metastasis at a secondary site. Expressed sequences of cells at the primary 
s.te and cells at the secondary site are treated with a genotoxic agent or 
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subjected to genotoxic conditions. Expressed sequences of the treated cells 
are amplified by differential display polymerase chain reaction and 
compared with untreated cells from any previous step to identify the 
metastasis sequence. 
5 The metastatic agent may be a chemical compound, a nucleic 

acid or a protein that alters the metastatic potential of a cell or relates to or 
is associated with the metastatic process. Chemical compounds include 
retinoids such as 4-hydroxyphenyl (4HP). Other agents include the proteins 
TGF-Pl, Cyclin Dl, p21, p34, p53, lysyl oxidase, caveolin, actin binding 
10 protein, ubiquitin activating enzyme El, nmb or a-actinin 3, or their 
respective genes. The metastatic agent may be a metastatic stimulant or a 
metastatic suppressant. Metastatic stimulants may be used to enhance the 
sensitivity of the metastasis sequence detection method. Conversely 
metastatic suppressants may be used to decrease the sensitivity of the 
15 method enabling the selective identification of potent metastatis sequences 
or sequences specific to a particular tissue type or metastatic disorder. 
Treatment may comprise direct contact with the metastatic agent or 
incubation for a period of time. Metastatic agents enhance the metastatic 
potential of the implanted cells and increase the sensitivity and the speed of 
20 the overall method. 

The cells at the primary site and the metastatic cells at the 
secondary site may be treated with a genotoxic agent in vivo or in vitro. In 
vivo treatment may comprise injecting genotoxic agents directly into the host 
mammal or specifically applying the agent with, for example, topical 
25 formulations. The cells at the primary site and the secondary site may also 
be isolated from the host animal and treated with the genotoxic agent in 
culture. Genotoxic agents are chemical compounds, nucleic acids or 
proteings that alter gene expression by effecting the nucleic acid genome 
directly by, for example, chemical modification, or indirectly by, for 
30 example, altering components associated with gene expression. Such agents 
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•nclude, for example, benzanthracene (BA), d.methyl benzanthracene 
(DMBA) and 5-azacytidrne, and may include metastatic agents as well In 
addmon to or in place of genetoxic agents, the cells may be treated to 
hypox,c condition, or radiation to alter gene expression. Metastatic 
5 sequences identified m these methods may be specific for particular 
genotoxic agents or conditions. 

Another embodiment of the invention is directed to the use of 
a host arumal with an altered genorypic or phenotypic predisposition for 
metastases. A host animal may be screened for endogenous expression of 
.0 metastases gene. Examples of metastatic sequences wh,ch may be screened 
for tnciude sequences tsolated by the method of the mvention, such as for 
example, the sequences listed m Figure 12 and Figure 13. Particularly useful 
metastatic sequences tnciude TGF.fi. A host animal with reduced levels of 
a metastatic gene product may be used to tsolate novel metastatic genes 
15 Host antmals may be screened for reduced levels of metastatic gene 
express.on. In addition, transgenic technology may be use to ablate a 
metastatic gene in the germluie of a host animal. 

Another embodiment of the invention is directed to analysts of 
a cell hne before their use as a starting matenal to isolate metastatic genes 
m a particular pathway. Analysis is useful m identifying cells, and 
consequent^ sequences specific ,„ these cells, whtch are particu,arly 
susceptible or resistant ,„ metastatic transformation. For example, a cell 
•"ghly predisposed to metastases may be especially sensitive for detecting 
metastatic genes. Conversely, a cel, showing high rcststance to metastasi! 
can be used ,„ iso.ate especially poten, metastatic sequences. One method 
.0 analyze susceptibility to metastas.s is ,„ determme the cellular response 
to growlh factors or growth inhibitors. Briefly, . c „ ntrol popu|at , on ^ 
test population of cells are exposed to a growth factor or a growth mhjbitor 
and the cellular response (e.g. proliferation, metabolism) recorded Cells 
showtng abnormal responses to the growth factor or growth tnhtbitor may b. 
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used as the starting material for metastatic gene isolation. Cellular response 
include changes in the rate of cellular division (e.g. thymidine uptake), 
changes in the expression of RNA or proteins, changes in cellular 
localization or modification patterns of RNA or proteins, and changes in the 
5 rate of uptake, release or metabolism of nutrients. 

Especially potent or weak metastatic genes may be detected by 
treating and analyzing the metastatic potential of different cells and selecting 
a suitable cell type as the starting material. For example, cells may be 
treated with myc, ras, p53 or combinations thereof and analyzed for cyclin 
10 Dl expression which is shown to correlates with metastasis. Figure 2 shows 
the in situ analysis of cyclin Dl in primary MPR tumors (Figure 2A) and in 
metastatic deposits from the lung of the same animal (Figure 2B). The gene 
expression pattern of cyclin Dl in MPR correlates with that of human 
prostate tumors (Figure 3) analyzed with stains specific for cyclin Dl 

15 expression. Normal human tissue shows no cyclin Dl expression or 
staining (Figure 3A). Moderately differentiated prostate cancers with 
dispersed (Figure 3B) or focal positively staining (Figure 3C) show moderate 
staining. Advanced poorly differentiated prostate cancer show strong 
nuclear as well as cytoplasmic staining (Figure 3D) implying strong 

20 expression of cyclin Dl. After treatment with myc, ras or p53 y cyclin D I 
expression shows correlation with the metastatic potential of the cell. Thus, 
cyclin Dl expressing cells are a source of cells with high metastatic 
potential. Conversely, cells with low cyclin Dl expression are a source of 
potentially metastatically resistant cells. 

25 This method may be adjusted for the isolation of metastatic 

sequences expressed along a particular developmental or differentiation 
pathway by combining the various treatment and analytical techniques. This 
approach is schematically represented in Figure 4. For example, a 
mammalian cell may be genetically ablated for TGF-pl Cyclin Dl, p53, 

30 lysyl oxidase, caveolin, aclin binding protein, uhiquitin activating enzyme 
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EI, nmb, aactmin 3. or p34. The genetically altered cell is used in a in vivo 
mouse prostate ^constitution (MPR) model. Metastatic and nonmetastatic 
cells isolated from the MPR may be analyzed directly or after induction with 
an agent such as the rGF-^gene or its product. Analysis involves the use 
5 of differential display polymerase chain reaction to identify differentially 
expressed bands. Sequences identified may be used for subsequent ablation, 
transformation or differential analysis. 

Genetic ablation (gene knockout) may be performed after a cell 
is selected or by selecting a cell comprising a genotype with the proper 
10 genetic ablation. Cells already comprising gene ablation may be acquired 
from a cell depository, from other laboratories or from a transgenic animal. 
As transgenic animals comprise genetically ablated genes m every cell, any 
tissue from a transgenic animal may be used as the starting material. 

The effects of oncogenes are at least additive and often 
15 synergistic. Thus, dominant oncogenes may be transfected together or 
multiple recessive oncogenes ablated together for a stronger effect. 
Furthermore, both methods may be combined and dominant oncogene 
transfection may be accompanied by recessive oncogene ablation. 

The function of the metastatic sequence may be determined by 
the differential expression pattern. For example, a dominate metastatic gene 
will be present in a metastatic cell while a recessive metastatic gene is 
present in a non-metastatic cell. Metastatic sequences may be detected as 
bands which are present in the DD-PCR of metastases isolated in secondary 
sites and absent from DD-PCR products of primary cells. These sequences 
may be dominant metastatic genes whose expression is directly responsible 
for metastases, or they may be metastasis associated genes whose expression 
correlates with metastasis. E.ther are useful for therapy and diagnosis 
Conversely, DD-PCR bands which are present m primary site tumors, but 
absent in secondary metastatic sites, may be dominant metastasis 
suppression genes. Dominant metastasis suppress.on genes comprise genes 



20 



WO 97/18454 



PCT7US96/I8567 



24 



whose expression suppresses metastasis while nonmetastatic genes comprise 
genes whose expression correlates with non-metastatic tissue. Genes which 
are highly correlative with either the metastatic phenotype or the non- 
metastatic phenotype may be isolated. Isolation can be performed by cutting 
5 the appropriate nucleic acid in the band of a polyacrylamide gel or by 
collecting the appropriate fraction in an HPLC or capillary electrophoresis. 
The nucleic acid may be cloned into a plasmid vector, and sequenced, or 
synthetically prepared. 

Another embodiment of the invention is directed to a method 
10 for identifying sequences in a metastatic pathway which are responsive or 
unresponsive to extracellular signals. Such sequences may be used in 
therapy and diagnosis of metastatic disorders. Implanted cells or cells from 
a primary site and cells from a secondary site are treated with extracellular 
signals. RNA sequences from the treated cells are compared with RNA 

15 sequences of the untreated cells (Figure 5B). Treated cells and untreated 
cells may be derived from a short term or long term in vitro culture of 
primary tumor and malignant tumors. Alternatively, a part of a primary 
tumor and a part of a malignant tumor may be collected before the animal is 
treated with an extracellular cytokine or other factor. Long term cultures, or 

20 cell lines of primary and malignant cells may also be used as recipients of 
extracellular growth signal treatment. Suitable signals for each experiment 
will depend on the cell type. Generally, growth factors, lymphokines, 
inhibitory factors, migratory factors or hormones may be used. Factors 
previously isolated by commercial or methods of the invention and factors 

25 associated with or causative or suppressive of metastasis are preferred. 
Thus, transforming growth factor pi (TFG-pi) may be used to treat cells 
before DD-PCR analysis. Proteins encoded by the genes isolated by this 
method are especially useful for the treatment of cells for the isolation of 
additional sequences. The identification of one sequence responsive to the 
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extracellular signal pathway allows for identification of additional genes 
upstream and downstream from that sequence. 

Another embodiment of the invention is directed to metastatic 
sequences identified by the methods of the invention. Metastatic sequences 
5 are sequences associated with the presence or absence of a metastasis or 
related to the metastatic processor can be used ir, the therapeutic treatment 
of metastasis. Metastaric-related sequences include dorrunant metastatic 
sequences, recessive metastatic sequences, metastasis assorted sequences 
dominant oncogenes, recessive oncogenes and cell cycle genes. These genes 
10 encode for example, proteins involved in cell cycle, signal processing DNA 
replication, growth regulation, mter and intra cellular signaling transcription 
control and translation control. Isolated sequences are useful in the 
treatment and for the detection of metastatic and other disorders. Disorders 
which may be treated comprise diseases involving proteins and sequences 
15 wmch are isolated by interaction with the sequences and proteins isolated by 
the method of the invention. Both malignant or nonmalignant disorders may 
be treated. Non malignant disorders include hyperplasia, dysplasia and 
hypertrophy. Examples of nonmalignant disorders include benign 
enlargement of the prostate, nodular hyperplasia, and bertign prostatic 
20 hypertrophy. 

Treatment may involve gene replacement, gene targeting 
antisense inhibition, gene expression or gene suppression. Gene replacement' 
involves replacing a copy of a defective gene with another copy by 
homologous recombination. Gene targeting involves the disruption of a 
cellular copy of a gene by homologous recombination. Antisense inhibition 
exploits the specificity of hybridization reactions between two 
complementary nucleic acid chains to suppress gene expression Cloned 
genes can be engineered to express RNA from only one or the other DNA 
strands. The resultant RNA hybridizes to the sense RNA and inhibit gene 
expression. Gene expression and gene suppression involve the introduction 
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of genes whose expression actively inhibits neoplastic transformation and 
metastasis. 

Another embodiment of the invention is directed to nucleic 
acids whjch comprise a sequence identified by the methods of the invention. 
5 The nucleic acid may be DNA, RNA or PNA and may be used as a 
diagnostic tool in the treatment of neoplastic disorders and malignant tumors. 
The nucleic acids may comprise additional sequences such as promoters, for 
expression of a sense or antisense message, recombination sequences for 
gene targeting, selectable markers for transfections, or replication origins for 
10 passage in a prokaryotic or eukaryotic host such as animal cells, bacteria or 
yeast. 

Another embodiment of the invention is directed to nucleic 
acids which comprise sequences identified by the method of the invention 
such as, for example, the caveolin, ABP280 (actin binding protein 280), the 
15 lysyl oxidase gene, and the nmb gene (clone 29), and other sequences listed 
in Figure 12 and Figure 13. Nucleic acids comprising a sequence 
corresponding to these genes may be used in treatment or diagnosis and in 
diagnostic kits for screening biological samples for the presence or absence 
of metastasis or metastatic potential. Treatment may involve using the 
20 sequences in gene therapy, including gene ablation, gene expression and 
antisense suppression. Diagnosis may involve genotypic analysis of samples 
to determine the existence and expression levels of the expressed sequences. 

Another embodiment of the invention is directed to the use of 
caveolin gene and protein in the isolation of oncogenes and in the treatment 
25 of neoplastic disorders such as, for example, prostate cancer. Caveolin is an 
integral membrane protein and a principal component of caveolae. Caveolae 
are small invaginations at or near the plasma membrane of most smooth 
muscle cells and may function as a component of specific signal transduction 
pathways. Surprisingly, caveolin expression increases in metastatic human 
30 prostate cells as compared to human primary prostate tumors. 
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As caveolin expression correlates w.th metastasis, application 
of b.oiog.cal technologies designed to block the activity of caveolin or the 
function of caveolae may have therapeutic benefits for the treatment of 
neoplastic disorders such as human prostate tumors. Specific treatment 
5 approaches using caveolin may include the delivery of antisense or dominant 
negate caveohn sequences using expression or viral vectors; as well as the 
use of specific anti-caveolin antibodies. Additional approaches could also 
target the cavoeo.ae, but are not specific^ based on caveolin function 
Addit.onal protem and non-protein components of caveolae could also be 
0 targeted for abrogation or the local or system, administration of nu^tional 
or bological agent may also be used. For example, caveolae are extremely 
nch in cholesterol and disnaption or depletion of this molecule may alter the 
function of caveolae. 

Another embodiment of the mvenrion is directed to methods 
for treanng a neoplastic disorder compnsing administering a 
pharmaceutical* effective am„„„, of composition containmg a nucleic acid 
havtng a sequence identified according to the methods of this invention its 
express.on product or fragments of either The nucleic acid may be tn'the 
form of a sense or antisense single-stranded or double-stranded nuc.eic acid 
The compostticn may be combined with a pharrnaceutica.lv acceptable 
earner such as water, alcohols, salts, oils, fatty acids, saccharides 
polysaccharides admin.stered by injection, pulmonaty absorption, topical' 
apphcatton or delayed release. More than one carrier may be used ,„ge* er 
to create a pharmaceutical with desirable properties. 

Another embodiment of the tnvention is directed to a ki, or 
Agnostic acid for screening biotogtcal samples for detection of metastasts 
neoplasta or km comprise sequences tsolated according to the methods of 
the ,nven„on and reagents and materials useful in such k„s, such as for 
example, buffers, salts, preservatives, and earners, all of „ hich are well 
known to those of ordtnaty skill m the art, Ki* are useful for the analysis 
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of tissues to screen those for the determination of normal nonmalignant 
neoplastic or malignant cells. Kits may comprise additional reagents useful 
for the extraction of nucleic acids from a tissue sample. Reagents for 
analyzing the nucleic acid extracted from a tissue sample such as polymerase 
5 chain reaction reagents and Southern blots reagents may also be included. 

The following experiments are offered to illustrate 
embodiments of the invention and should not be viewed as limiting the scope 
of the invention. 



Examples 

10 Example I Production of Mouse Prostate Reconstitution Tumors anH 
Metastasis. 

Mouse Urogenital Sinus (UGS) tissue was isolated from 17 
day old mice embryos. Each isolated UGS was digested with 1% trypsin for 
three hours at 4°C. The trypsin was inactivated by the addition of fetal calf 

15 serum. UGS cells were digested with 0.125% collagenase for 1.5 hours, 
counted and mixed at the appropriate cell rations prior to infection with 
retrovirus in the presence of polybrene. Retroviruses used include 
Zipras/myc-9. Control experiments were performed using BAGct virus. 
After a two-hour infection, the infected cells were centrifuged and individual 

20 reconstitutions containing 1.5 x 10° cells produced by resuspending the cells 
in rat tail collagen at a density of 6.0 x 10 7 cells per ml. Aliquots of the 
infected UGS cells were placed in (DME) with 10% fetal calf serum 
overnight at 37°C, 5% C0 2 . 

The next morning each cell/collagen reconstitution was 

25 implanted under the renal capsule of an adult male +/+ animal. 
Reconstitutes were harvested from the mice five weeks later when they 
showed signs of obvious distress from the tumor burden. Metastasized 
tumors were isolated from the same mice at sites outside the renal capsule. 
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Isolated tumors and metastasises were etther stored in liquid nitrogen or ,„ 
preservatives such as 10% buffered formalin. 

Cell lines were derived from fresh rumors by mincing a small 
portton of ,he primary metastatic or nonmetastatie mmor and placing each 
m explam culture m Dulbecco's Modified Ea gl e Medtum (DMEM) 
supplemented with ,0% feta, calf serum. Cells „h,ch grow from each 
explant were propagated in DMEM and 10% fetal calf serum. 

For histological analysis, a portion of a fresh tumor was fixed 
« .0% buffered formalin and embedded in paraffin for sectjoni ^ 
>0 sta.nmg „,,h hematoxylin and eosin (H&E) or immunohistochem.cal 
s.a.nmg. Immunohts.ochenucal localization of cytokeratins was detected 
ustng polyclonal cytokeratin antiserum A575 (Dake Co.; Carpmteria CA1 
and Vectastain ABC kit (Vector Laboratories; Burlingame, CA). ' 

Example 2 Isolation of C- DNA for nn.iv. 

I 5 Total cellular RNA was isolated by utoacentrifugarion through 

cestum chJoride. Briefly, „ p t „ „ ffam „ ^ from ^ » 

organs was placed into 4 m, of lce . coM GIT buffer (4M guamdme 
■soth.ocyanate, 0.025 M sodium acetate, 0.1 M P-mercaptoethanol, and 
homogenrzed ,„ a t.ssue homogenizer (Po ly0 -„„ 0 r equivalent) The 
20 homogenate was careMly layered over 4 ml of 5.7 M CsCI, 0.024 M sodium 
aceute (, 8 g CsC, per m„ m a centring tub, ^ ,ayers were censed 
™ RP M for „ hours m . SW3Q . , rotor ^ _ conec 

fte «<*ce between the cushion and the supernatant, diluted two f„ lds with 
water, added to 2, volumes of edtano, and spooled ou, on a g,ass rod. RNA 
•5 -hat formed a pelle. on the bottom of the CsCI layer was .suspended and 

horoorm a„ d prec , pitaKd ^ ^ ^ ^ 

d.ethy.pyrocarbonate Seated water. The concentration ofDNA and RNA 
were be determmed by absorption at 260 nanometers. 
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Example 3 Differential Display Polymerase Chain Rear.tinn 

mRNA isolated from primary tumors or metastasis was reverse 
transcribed with one of the primers and subjected to DD-PCR using the same 
primer as both the forward and reverse primer. A set of 24 primers 
comprising short oligonucleotides were used for both the reverse 
transcription of mRNA into c-DNA and for differential display polymerase 
chain reaction. The sequence of the primers used are shown in Table 1. 

Table 1 



r i unci J\tj. 


Sequence 


Sequence number 


l 
i 


J - 1 UALAA I C(j-j> 


(SEQ. ID. NO. 1) 


2 


j -AOL. J AAvjG 1 L-3 


(SEQ. ID. NO. 2) 


j 


-> - J J ULCjA J CC-3 


(SEQ. ID. NO. 3) 


4 


3 -A J AtLu J J OL-3 


(SEQ. ID. NO. 4) 


5 




(SEQ. ID. NO. 5) 


6 


5 '-TGG ATTGGTC- 1 ' 


fSFO in nth £\ 


7 


5-.CTTTCTACCC-3' 


(SEQ. ID. NO. 7) 


8 


5'-GGAACCAATC.3' 


(SEQ. ID. NO. 8) 


9 


5'-TGGTAAAGGG-3' 


(SEQ. ID. NO. 9) 


10 


5'-TCGGTCATAG-3' 


(SEQ. ID. NO. 10) 


11 


5'-CTGCTTGATG-3' 


(SEQ. ID. NO. 11) 


12 


5-G ATC AAGTCC-3 ' 


(SEQ. ID. NO. 12) 


13 


5'-GATCCAGTAC-3' 


(SEQ. ID. NO. 13) 


14 


S'-GATCACGTAC-.l 1 


(SEQ. ID. NO. 14) 


15 


5'-GATCTGACAC-3' 


(SEQ. ID. NO. 15) 


16 


5'-TTAGCACCTC-3' 


(SEQ. ID. NO. 16) 


17 


5*-ACCTGCATGC-3' 


(SEQ. ID. NO. 17) 


18 


5'-GCTATACTGC-3' 


(SEQ. ID. NO. 18) 


19 


5'-AGTTGCCAGG-3' 


(SEQ. ID. NO. 19) 
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20 


5-AAGCCGTGTC-3' 


(SEQ. ID. NO. 20) 


21 


5'-TCAACGCTCA-3' 


(SEQ. ID. NO. 21) 


22 


5'-TGTTCGAATC-3' 


(SEQ. ID. NO. 22) 


23 


5-CGAGTCAGAC-3' 


(SEQ. ID. NO. 23) 


24 


5-TATGAGTCCG-3' 


(SEQ. ID. NO. 24) 



PCR was perfonned using standard conditions with 40 cycles 
of denaturation at 94'C for 40 seconds, annealing at 40X for 2 minutes and 
elongation at 72T for 35 seconds. After PCR, the products were analyzed 
with non-denaturing polyacrylamide gel electrophoresis (PAGE) at 12 watts 
10 for 1 5 hours. Bands which differed between test and control samples were 
eluted from the gel, subjected to reamplification by PCR and cloned 
Polyacrylamide gel electrophoresis of DD-PCRs, and the accompanying 
RNA blot analysis showing the isolation of sequences with substantial 
similarity to nmb and TGF- P is shown in F.gure 6 and Figure 7 respectively 
15 Additional sequences isolated by this method show substantial similarity to 
lysyl oxidase, actin binding protein, ubiquitin activating enzyme El, o- 
actinin, and P34 ribosomaJ binding protein sequence (Figure 8). Differential 
expression of caveolin was demonstrated by DD-PCR followed by PAGE 
(Figure 9). 



20 Example 4 nllAUdotwe Determinant 

The p53 allelotype of a cell sample was determined by PCR 
Briefly, nucleic acid is extracted from a tissue sample or a cell culture 
sample. An aliquot of nucleic acids ,n placed m 45 ul a li quot 0 f a master 
mix which contained a final concentration of 0.2 mM of each dATP dTTP 

25 dGTP, dCTP, 1.5 mM MgCl, 0.5 unit Taq polymerase, 0.05 uM of each of 
two primers set specific for the normal wildtype allele of p53 (S> 
GTGTTTCATTAGTTCCCCACCTTGAC.3-, SEQ. ID NO ?5 5 - 
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AGAGCAAGAATAAGTCAGAAGCCG-3 1 , SEQ. ID NO. 26). A control 
set of primers specific for the fibroblast growth factor-7 gene was used to 
monitor the polymerase chain reaction experiment (5 f - 
ACAGACCGTGCTTCCACCTCGTC-3', SEQ. ID NO. 27; 5*- 

5 CCTCATCTCCTGGGTCCCTTTCA-3', SEQ. ID N0.28). One fil of the 
reaction from the first round of PCR was used as the starting material for a 
second round of PCR using a second set of wildtype p53 specific primer (5'- 
GTCCGCGCCATGGCCATATA-3', SEQ. ID NO. 29; 5'- 
ATGGGAGGCTGCCAGTCCT AACCC-3'. SEQ. ID NO. 30). This second 

10 round of PCR was also monitored using a control set of primers specific for 
the fibroblast growth factor-7 (5'-ACAGACCGTGCTTCCACCTCGTC-3\ 
SEQ. ID NO 27; 5-CCTCATCTCCTGGGTCCCTTTCA-3\ SEQ. ID NO 
28). 

After PCR the products were analyzed with non -denaturing 
15 polyacrylamide gel electrophoresis (PAGE) at 12 watts for 15 hours. Bands 
which differed between test and control were eluted from the gel, subjected 
to reamplification by PCR and cloned. 

Example 5 Induction of cell lines with TGF-Ol Influence Cellular Gene 
Expression. 

20 148 1 -PA cells were grown overnight in DME supplemented 

with 10% fetal calf serum overnight at 37°C ? and 5% C0 2 . Induction was 
performed by treatment with TGF-fil at a concentration of 2 nanograms per 
ml. The treated cells were returned to the incubator and cultured for 12 
hours. After induction, cells were washed in phosphate buffered saline and 

25 harvested and concentrated by centrifugation. 

RNA was extracted from treated and untreated cells and 
subjected to DD-PCR. Differentially expressed bands detected by DD-PCR 
were cloned and differential expressions were confirmed using RNA blots 
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(Figure 10). Subsequent cloning and sequencing identified the bands as 
ABP280 or filamin. 

One gene isolated showed differential expression in cells 
induced by TGF-f) (Figure 1 1, clone 29), while a control probe on the same 
5 cell line showed no difference in expression levels (Figure 1 1, GAPDH). 

Example 6 Metastatic Sequences Isolated . 

Using the methods of Examples 1, 2, 3, 4, and 5, a plurality of 
metastatic sequences were isolated and sequenced. The expression of the 
metastatic sequences in primary cells and in metastatic cells were determined 
10 using RNA blots. The nucleic acid sequences of other isolated sequences are 
listed in Figure 12. Sequence analysis and expression analysis was 
performed on the isolated cloned and the results of these studies are 
summarized in Figure 13. 

Example 7 Caveolin Immunoassay in Human Prostate Cancers . 

15 Primary site human prostate tumors and metastases were 

isolated and analyzed for caveolin expression by immunoassay. The results 
of the assay is shown in Table 3. Metastases shows higher levels of caveolin 
proteins in metastases than in primary tumors. Immunohistology of tissue 
sections reveals both elevated levels and distinct distribution of caveolin 

20 protein in metastatic human prostate when compared to a primary human 
prostate tumor (Figure 14). 



Table 3 



Patients 


Primary-site 


Metastases in lymph 
node 


1 


-f 


++ 


2 




+++ 
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10 



3 


-H- 




4 


++ 


-H- 


5 


-f 


+ 


6 


++ 


++ 


7 


++ 


+++ 


8 , 


+ 


-f 


9 






10 


+ 




11 


+ 


+ 


12 


++ 




13 


+ 


+ 


14 


+4- 


+++ 



Other embodiments and uses of the invention will be apparent 
15 to those skilled in the art from consideration of the specification and practice 
of the invention disclosed herein. The specification and examples should be 
considered exemplary only with the true scope and spirit of the invention 
indicated by the following claims. 
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I Claim : 

1 . A method for identifying a metastatic sequence comprising the steps 
of: 

a) transfecting an oncogenic sequence into a mammalian cell to 
5 form a population of transfected cells; 

b) introducing transfected cells to a primary site of a host 
mammal; 

c) maintaining said mammal for a period of time sufficient to 
develop a metastasis at a secondary site; 

10 d) amplifying expressed sequences of the transfected cells and 

expressed sequences of the metastasis by differential-display 
PCR; and 

e) comparing the amplified sequences and identifying the 
metastatic sequence. 
15 2. The method of claim 1 wherein the mammalian cell is transfected by 
calcium phosphate transfection, viral transduction, lipofection, dextran 
sulfate transfection or electroporation. 

3. The method of claim 1 wherein the oncogenic sequence is a sequence 
of the gene that expresses the oncoproteins p21, p34, p53, myc, ras or src. 
20 4. The method of claim 1 wherein the oncogenic sequence is a 
metastatic sequence. 

5. The method of claim 4 wherein the metastatic sequence is a sequence 
of the gene that expresses cyclin Dl, caveolin or TGF-pi. 

6. The method of claim 1 wherein the oncogenic sequence is a gene 
25 ablation sequence specific for the gene that expresses the protein TGF-P 1 , 

cyclin Dl, p2 1, p34, p53, lysyl oxidase, caveolin, actin binding protein, 
ubiquitin activating enzyme El, nmb or a actinin 3. 

7. The method of claim 1 wherein the mammalian cell is treated with a 
metastatic agent that alters gene expression before or after transfection. 
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8. The method of claim 8 wherein the metastatic agent is benzanthracene 
(BA), dimethyl benzanthracene (DMBA) or 5-azacytidine. 

9. The method of claim 1 wherein the mammalian cell is a primary or 
established cell line. 

5 10. The method of claim 1 wherein the mammalian cell is derived from 
urogenital sinus tissue. 

1 1 . The method of claim 1 wherein the mammalian cell is a fetal cell. 

12. The method of claim 1 wherein the mammalian cell contains a 
genetically ablated endogenous gene wherein said gene is TGF-fi], cyclin 

10 DJ, p2U p34, p53 t ras, myc and homologs thereof 

13. The method of claim 1 wherein the mammalian cell is derived from 
the same species as the host mammal. 

14. The method of claim 1 wherein the mammalian cell and the host 
mammal are histocompatible. 

15 15. The method of claim 1 wherein the mammalian cell and the host 
mammal are genetically matched. 

16. The method of claim 1 wherein the transfected cell is maintained in 

vivo or in vitro. 

17. The method of claim 1 wherein a collection of the expressed 
20 sequences is obtained from cells at the primary site of the host mammal. 

18. The method of claim 1 wherein a collection of the expressed 
sequences is obtained from a cell line of immortalized transfected cells. 

19. The method of claim 1 wherein the transfected cells are introduced 
to the primary site by subcutaneous implantation. 

25 20. The method of claim 1 wherein the host mammal is a mouse, a rabbit 
or a primate. 

21. The method of claim I wherein the host mammal is an syngeneic, 
xenogeneic, immunocompromised or transgenic host mammal. 

22. The method of claim 1 wherein the host mammal has reduced 
30 expression of TGF-fi. 
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23. The method of claim 1 wherein the primary site is the renal capsule, 
the prostate or the testis. 

24. The method of claim 1 wherein the secondary site is selected from the 
group of sites consisting of lung, kidney, liver, lymph nodes, brain, bone, 

5 testis, spleen, ovaries and mammary. 

25. The method of claim 1 wherein differential display PCR is performed 
with an anchor primer and a variable primer. 

26. The method of claim 25 wherein the anchor primer comprises a 
polythymidine sequence and a dinucleotide sequence connected to a 3'- 

10 terminus. 

27. The method of claim 26 wherein the polythymidine sequence 
comprises between about 5 to about 30 thymidines. 

28. The method of claim 26 wherein the dinucleotide sequence is selected 
from the group of sequences consisting of AA, AG, AC, AT, GA, GG, GC 3 

15 GT, CA, CG, CC and CT. 

29. The method of claim 25 wherein the anchor primer or the variable 
primer comprise a detectable moiety selected from the group consisting of 
radioactive moieties, phosphorescent moieties, magnetic moieties, 
luminescent moieties and conjugatable moieties. 

20 30. The method of claim 25 wherein the anchor primer and the variable 
primer have a common sequence. 

3 1 . The method of claim 1 further comprising the step of treating the host 
mammal with a metastatic agent. 

32. The method of claim 3 1 wherein the metastatic agent is a retinoid. 
25 33. The method of claim 1 wherein identifying comprises determining the 

nucleotide sequence or expression product of the metastatic sequence. 

34. The method of claim 1 wherein the metastatic sequence identified is 
specifically expressed in metastatic or non-metastatic cells. 

35. A metastatic sequence identified by the method of claim 1. 
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36. The metastatic sequence of claim 35 which is a sequence which 
encodes TGF-fil, Cyclin Dl, lysyl oxidase, caveolin, aciin binding protein, 
ubiquitin activating enzyme El, nmb, a-actinin 3 or homologs thereof. 

37. A method for identifying a metastatic sequence comprising the steps 
5 of: 

a) pretreating a mammalian cell with a metastatic agent to form 
a population of cells predisposed to metastasis; 

b) introducing the pretreated cells to a primary site of a host 
mammal; 

10 c) maintaining said mammal for a period of time sufficient to 

develop a metastasis at a secondary site; 

d) treating cells of the primary or secondary sites with a 
genotoxic agent; 

e) amplifying expressed sequences of treated cells by 
15 differential-display PCR; and 

f) identifying the metastatic sequence. 

38. The method of claim 37 wherein the metastatic agent is an oncogenic 
sequence and the mammalian cell is treated by transfection with the 
oncogenic sequence. 

20 39. The method of claim 37 wherein the metastatic agent is TGF-Pl, 
Cyclin Dl, p2 1, p34, p53, lysyl oxidase, caveolin, actin binding protein, 
ubiquitin activating enzyme El, nmb or oc-actinin 3, and the mammalian cell 
is treated by contact with the metastatic agent. 

40. The method of claim 37 wherein the genotoxic agent is 
25 benzanthracene (BA), dimethyl benzanthracene (DMBA) or 5-azacytidine. 

41. The method of claim 37 wherein the metastatic agent and the 
genotoxic agent are the same. 

42. The method of claim 37 wherein the expressed sequences amplified 
are compared to expressed sequences amplified from mammalian cells 

30 before pretreatment to identify the metastatic sequence. 
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43. The method of claim 37 wherein the expressed sequences amplified 
are compared to expressed sequences amplified from pretreated cells to 
identify the metastatic sequence. 

44. The method of claim 37 wherein the expressed sequences amplified 
5 are compared to expressed sequences amplified from cells obtained from the 

primary site or cells obtained from the secondary site. 

45. A nucleic acid sequence identified by the method of claim 37. 

46. A method for identifying a metastatic sequence comprising the steps 
of: 

10 a) treating a mammalian cell with a metastasizing agent to form 

a population of treated cells; 

b) introducing treated cells to a primary site of a host mammal; 

c) maintaining said mammal for a period of time sufficient to 
develop a metastasis at a secondary site; 

15 d) amplifying RNA sequences of treated cells and RNA 

sequences of the metastasis by differential-display PCR; 
e) comparing the amplified sequences and identifying the 
metastatic sequence. 

47. The method of claim 46 wherein the metastatic agent is a chemical 
20 compound, a nucleic acid, a protein or a combination thereof. 

48. The method of claim 47 wherein the chemical compound is a 
benzanthracene, dimethyl benzanthracene, or 5-azacytidine. 

49. The method of claim 47 wherein the nucleic acid contains an 
oncogenic sequence. 

25 50. The method of claim 47 wherein the protein is p53, myc, ras, caveolin 
orTGF-pi. 

51. The method of claim 46 wherein the mammalian cell is transfected 
with an oncogenic sequence before or after treatment. 

52. The method of claim 46 wherein the mammalian cell is a cell line. 
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53. The method of claim 46 wherein the mammalian cell is derived from 
lymphatic tissue, hematopoietic cells, reproductive tissues or urogenital sinus 
tissue. 

54. The method of claim 46 wherein the mammalian cell is a fetal cell. 
5 55. The method of claim 46 wherein the mammalian cell is derived from 

a transgenic animal, 

56. The method of claim 46 wherein the primary site is the renal capsule, 
the prostate or the testis. 

57. The method of claim 46 wherein the secondary site is selected from 
10 the group of sites consisting of lung, kidney, liver, lymph nodes, brain, bone, 

testis, spleen, ovaries and mammary. 

58. The method of claim 46 wherein differential display PCR is 
performed using an anchor primer and a variable primer. 

59. A metastatic sequence identified by the method of claim 46. 

15 60. A diagnostic kit for screening a biological sample for the presence or 
absence of metastasis comprising a metastatic sequence identified according 
to the method of claim 46. 

61. A method for treating a metastatic disorder comprising administering 
a composition containing a therapeutically effective amount of a metastatic 

20 sequence or the expression product of said metastatic sequence to a patient 
wherein said metastatic sequence was identified according to the method of 
claim 46. 

62. The method of claim 61 wherein said metastatic sequence is selected 
from the group consisting of TGF-fl], Cyclin Dl, p2J, p34, p53, lysyl 

25 oxidase, caveolin, actin binding protein, ubiquitin activating enzyme El, 
nmb, a actinin 3 and homologs thereof. 
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FIGURE 9 
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FIGURE 12AA CL-I#2 

AAl'TTTTTTTTTCGACGGCCCAACGGAATTTTTTTTTTCGACGGCCCAACGGAATTTTT 
TTTTTCGACGGCCCAACGGGAATTCGGCTTAGCTAAGGTCACCCAGACTTCATGGACT 
TGTCTATTTTCTTGCCCAAAGGGATAGTTCCTCAGGTATTTGGGGACAGCATTCACCTC 
TTGCAGGAGCTATGCCTGTGTGTTTGTGCTAAGTTGATACTTTCTGCGATGATCTCAC 

(SEQ ID NO. 3 1 ) 

FIGURE 12AB Cl,-I0#3 

TACCATCGGAGAAAGAAGACCAAGCAAGGCTCAGGCAGCCACCGCCTGCTTCGCACT 
GAGCCTCCTGACTCAGACTCAGAGTCCAGCACAGACGAAGAGGAATTTGGAGAATTG 
GAAATCGCrCTCGTTTTGTCAAGGGAGACTATCCCGATGCTGCAAGATCTGCTGTCCCT 
CTGGCCTTTGTCATCCTCGCGCCTGCGTTGTGGCCTCTGTCiGGCTTGGTGTGGAGCAAA 
TGGCTCTCAAGGAGGACTGAGTCTCAAGGAAATT (SEQ ID NO. 32) 

FIGURE 12 AC CL-11A#5 

AGCTAAGGTCAGGAGGTGTCTGAAGAATTGGCTGATGCATGGCAGGGATGTTGTTGAC 
CTGCTTTTAGAACAATACTTCCATTTAATTATAGCATATCTTATGTGTGTATTAAAGCA 
GAGCCGATCTGGTGGGGCTCATTAAGTAAATGTACTTACTGCAAAAGGTTCAACTGGT 
GACCCCAGTTTTCCCCAGAAGCAATATGA'l'AGGACAGAGGCGACTCCTGCAAGTTGTC 
TCAGACTTCACACATACATTGTGACATTCTCTGAGCATGTGCACTGTACATGATATGAC 
ACTATCAA (SEQ ID NO. 33) 

FIGURE 12AD CL-11C#2 

AGCTAAGGTCCACTACCTTGTGAAGATGTATAAACACCTGAAATGTAGAAGCGATCCG 
TATGTCAAGATCGAGGGGAAGGACGCTGACGACTGGCTGTGTGTGGACTTTGGGAGTA 
TGGTGATCCATTTGATGCTTCCAGAAACCAGAG.AAACCTATGAATTAGAGAAACTATG 
GAC'rCTACGTTCTTTTGATGACCTTAGCTAAGCCGAATCAGCACACTGGCGGCGTTACT 
AGTGGATCGAGCTCGTACAGCTGATGCATAGCTTGAGTATCTATAGGTTACTAATAGC 
TGGCTATCATGTCAAGCGTTC (SEQ ID NO. 34) 



FIGURE 12/VE CL-I2#1 
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AGCTAAGGTCAAAATAAAAGCTCAAGATGACATCAGTCCCATTTGTCCTAAGTCCTGG 
TGTTGTATGGATGGTAAGCAGCAGCCAATTATGGTGACAGGTGATAGATCCAATTTGT 
TAACATTTCTCCATCTCTAAGCCATCCTTAAAGAAAATCATGAATGGAGTCACACCAT 
CTTCACGGTAGTCCAGGAGAGCAACCATACCATCTGGATTCATGTTTCACCAATAAAA 
ACTGGTAGTTATTGAATTAGCAAGGATGTGCTACTCTCTGCAGCTCAGC 

(SEQ ID NO. 35) 

FIGURE I2AF CL-I3#l 

AGCTAAGGl'CTCATGCAATGGAACTTAATTCTTAGAACTGTAAGAATTACATCAAACA 
TAAAAGCCTCCCTATTAATGTAGTCCACAAAACTGGCAGGTATATATGCCTTCTGAAT 
TTGTCTCCAGTGACTTTGGTAAATCTAACTAAATTTTTAAAAATTCTTAATGAATTTAT 
CGTCAACAACAACCACCTCTTGGAAAATTAACCCTTGCACiTGTCTGTGTTAGACTCAG 
AAGTCAA (SEQ ID NO. 36) 

FIGURE 12 AG CL-1434 

GAATTCGGCTTAGCTAAGGTCAGCGTGAAGTTTAAGCAGACATGAGTCTGAAACAGTC 
TCATGACACATCTGATAGGATTTTTTAAGACTGCCTGGCTTAGTCTTACTGCTGTTAGT 
GTATATTAGGTGTTGTACACATTATAAAGAAAATTATGTCTCATTATCTTGTTTAAGTC 
AAGGAAAATAGAGAACTTTGGTCAAAT (SEQ ID NO. 37) 

FIGURE 12 AM CL-2#2 

GAATTCGGCTTAGCTAAGGTCAGCGTGAAGTTTAAGCAGACATGAGTCTGAAACAGTC 
TCATGACACATCTGATAGGATTTTTTAAGACTGCCTGGCTTAGTCTTACTGCTGTTAGT 
GTATATTAGGTGTTGTACACATTATAAAGAAAATTATGTCTCATTATCTTGTTTAAGTC 
AAGGAAAATAGAGAACTT 

(SEQ ID NO. .18) 

FIGURE I2AI CL-2#3 

GAATTCGGCTTAGCTAAGGTCAAAATACACGGATTGCAATCACTTTTCTAAACAAAAG 
AAACAAAGTAACTGCTGAGGTTAGCAAAGATGAGTTCTCGTCATACTGCCTTGTACTG 
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TTTTGTGAACTGTGTTATTAAAAATCTGAGCTTAACAAAATCTTTACAAGTCACCTCAT 
GAAAACAGCATTTGGCCAATAAGAGTTTAATTCCACACCAGTGAGACCTTAGCCT 

(SEQ ID NO. 39) 

FIGURE 12AJ CL-2#4 

GAATTCGGCTTTCTGCGATCCACTCTTTGAAGCTATTGCiCAAGATATTCAGCAACATCC 
GCATCAGCACGCAGAAAGAGATATGAGGGACA1TTCAAGGATGAAAGGTTTTTTTCCC 
CCCTTACTATTTCCTTGGTGCCAATTCCAAGTTGCTCTCGCAGCAGCAAATTTATGAAT 
GGTTTGTCTTGATCAAGAACAAAGAATTCATTCCCACCATTCTCATATATACTACTTTC 
TCTTCTT (SEQ ID NO. 40) 

FIGURE 12AK CL-3#1 

GAATTCGGCTTTCTGCGATCCACTCTTTGAAGCTATTGGCAAGATATTCAGCAACATCC 
GCATCAGCACGCAGAAAGAGATATGAGGGACATTTCAAGGATGAAAGGTTTTTTTCCC 
CCCTTACTATTTCCTTGGTGCCAATTCCAAGTTGCTCTCGCAGCAGCAAATTTATGAAT 
GGTTTGTCTTGATCAAGAACAAAGAATTCATTCCACCATTCTCATATATCTACGTCTCT 
TCTAG (SEQ ID NO. 41) 

FIGURE 12AL CL-4#! 

GAATTCGGCTTTCTGCGATCCTAGAGCAGGTAAGTGAAGAAGGCCAGTAAGTTTTAAG 
GATGGCCTTGTTGCCTTCTATCAAGTTC rCTGGGACTTTGTAATTTTGATTACTACTATT 
GATACATGGTTATGGTCAGAAGGCCTCTTCTCCCTT (SEQ ID NO. 42) 

FIGURE 12AM CL-4#2 

AGCTAAGGTCCGGACTCTATGGCATGACCCCAAAAACATTGGCTGGAAAGATTACACT 

GCCTACAGGTGGCACCTGATTCACAGGCCTAAGACAGGCTACATGAGAGTCTTAGTGC 

ATGAAGGAAAGCAAGTCATGGCTGACTCAGGACCAATTTATGACCAAACCTACGCTC. 

GTGGACGGCTGGGCTGTTTGTCTTCTCCAAGAGATGGTCTATTCTCGGACCTCAAGTAT 

GAGTGCAGAGATGCTAGAGAGCAGGCTCAGTCTCAGCA 

(SEQ ED NO. 43) 
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HGURI: 12 AN CL-5A#4 

TGACCATCCiAGTGCATCAGCCTCATCGCrGCTGGCCGTCGGGAAGGAGAAATTCATGCA 
GGATGCTTCAGATGTGATGCAGCTATTGTTGAAGACACAGACAGACTTCAATGATATG 
GAAGATGACGACCCCCAGATTTCTTACATGATCTCAGCATGGGCCAGGATGTGCAAAA 
TCTTGGGAAAGAATTCCAGCAGTACCTTCCCGTGGTTATGGGGCCGCTGATGAAGACT 
GCTTCAATTAAG TCCTGAGTGCCTCTAGACACCAGGACATGAGA TATGAGGTA 

(SEQ ID NO. 44) 

FIGURE 1 2 AO CL-6#2 

TGACCATCGTGTAGTTGGTGTGCrrGTTGTCGAAGATGAGGGCCTCCTGGATGAGCTG 
GTGCTGCTGCTCCAGCAGGTCCAGGCTGGGCTTGTAGTCCACGATGCTGCGCTCGTAC 
TGCTTCAGGTGGCTCAGCTGGTCTTCCAGAGTCCCGTTCATCTCAATGGAGATGCGCCC 
GATCTCCTCCATCTTAGTCTGGATCCACGGCCCCACCATATTGGCTTGGCTGGCGAACT 
GTCGGCGAAGGCTGCATTGGATTGCT 

(SEQ ID NO. 45) 

FIGURE 12AP CL-7#4 

TGACCATCGAACACCCCAACACTCTCCACTACCTGCCATTTCTTCCAGCCTTATCCACA 
CCACCCCGTTTCTCCTGAAGACTGATTTGCTTAGCAACTGCACTGAGCCAACCCTGAA 
GACACATC5ATTATTGGTTGGGCTCCATTAAACAACAAGCCTAGTGCTTGGGAAGGGGG 
GTGGGGAGGGGAAGAGACGTGAGAAGCATGTTGGCGTAGACCTTGAGGCATGGATGA 
AGCATCTGCCGGCCTGACCTGGTACAGGTGGCATCTGCACTGCAGCAAGGC 

(SEQ ID NO. 46) 
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FIGURE 12AQ CL-8#2 

TGACX'ATCGAAGTGCAAAGGAAATGACTTGATTTCATGAAGTATCTCCAGAAGTAACG 
CTTTGTTTTCTGCATCCTGAACTTTATTCCCAGTGAAGAOCTGAAAATCTGGACGCTCA 
AAAAATGGAAGCACTTTGGAGAGAGCCCTTAACTCTATCAGGTACAGGAAGTACAAG 
TTCCTCAGCCTTCGTGGGCCTTCTCCTTCAGTCAGAATCCATCAAAGGTGCTGGAACTC 
TGTGACATTGTGACCCATTCTTTCAGCCAGTATCTGTAAGATAC 

(SEQ ID NO. 47) 

FIGURE 12AR CI--9#1 

GGGAACGAATGATCTGGAACTGTGGCTTGTAGACAACCCAAATATCTTAGGTAGGTAA 
GAAATTCCAGCATCACACTATATAGGAAATACTGTGCGAAACTGACAGTTAACTGTGC 
ACAAAGTTCAATGGCTTCAAAATAATGTATAAAGGATAAGAAGAAACCAGTTTACCAT 
TTTGGT ATTATTTTGGTTGCTTTGTATAACTTCAATAATTT (SEQ ID NO. 48) 

FIGURE 12AS CL-54A#2.-SP 

GGGAACGAATGATCTGGAACTGTGGCTTGTAGACAACCCAAATATCTTAGGTAGGTAA 
GAAATTCCAGCATCACACTATATAGGAAATACTGTGCG.AAACTGACAGTTAACTGTGC 
ACAAAGTTCAATGGCTTCAAAATAATGTATAAAGGATAAGAAGAAACCAGTTTACCAT 
TTTGGTATTATTTTGGTTGCTTTGTATAACTTCAATAATTT (SEQ ID NO. 49) 

FIGURE I2AT CL-54A#2.-S0 

GACGTAAGCC (SEQ ID NO. 50) 

FIGURE 12AU 

CCACAAAGCAAGCTTCTGTCTGGAGTACAGCTCCTGTGACTATGGGTACCACAGGGCC 
TTTGCGTGCACTGCACACACACAGGGATTGAGTCCTGGATGTTATGACACCTATGCGG 
CAGACATAGACTGCCAGTGGATTGATATTACAGATGTACAACCTGGAAACTACATTCT 
A AAGGTC AG TGTAAA 

(SEQ ID NO. 51) 



FIGURE I2AV 
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CTATCAATGAAGGGGGAGATCACTGGGTAAGTTCGAATGCCCTCAGGCAAGGTGC'iCC 
CAGCCTTCCATTACTGAATTCAAAGATC.GCACTGTTACTGTACGTTACTCACCCAGTGA 
AGCTGGCCTGCATGAAATGGACATTCGCTATGACAATATGCATATCCCAGGAAGCCCT 
CTGCAGTTCTATGTTGATTATGTCAACTGTGGCCACATCACTGCTTATGGTCC 

(SEQ ID NO. 52) 

FIGURE 12AW 

TTAGCACCTCGACCACGAAATGAGGAAGATGCAACAGACGTGGTGGGCCTGGCTCAG 

GCTGTAAACGCTCGGTCCCCACCTTCAGTAAAACAGAACAGCTTGGATGAAGACCTTA 

TTCGGAAGCTAGCTTATGTTGCTGCTGGGGACCTGGCACCCATAAATGCTTTCATTGG 

G GG C C TT G C T G C C C A GG AA G TC AT G A A G G C C TO C T C TO G AAA G TTTA TG C CC ATC ATG 

CAGTCiGTTGTACTTTGATGCTCTTGAATCiTCTCCCAGAACGGACAAAGAGGC'rCTGAC: 

AGAGGAGAGTGCCTCCCACGTCAGAACCGTTACGATGGGCAGGTAGCTGTATTGGTCA 

GACTTCAGGAGAAGCTGAGAAGCAAA (SEQ ID NO. 53) 

FIGURE 12AX 

TTAGCACCrCCAATGGCTGGGTACCAGCCAGCCGCAATGTCCGCTCCACAAATTTGGA 
GTCTGTGAGGTACTGATTAACATTTTCTGCTGGCTGCTTGAAAAGGCCTTCAAATTCAT 
CCCGGGCCCACTGAAGAGTGTGTTCGATGGCATTGGGAAAGTTTTTCAGGGTACAAAT 
GGGGATGGATTTCTCTGGTGGATCCTGGCTAGACGTGATGGATTCTGTCAGGAAGGGG 
ATTACCACCTGCACGTTGCCCTTT 

(SEQ ID NO. 54) 

FIGURE I2AY 

TTAGCACCTCACACTCACATGCCCTTCTACATAGAGACTGGTTAAACAGCCCTCCCTCC 

CTTGTCCCGACTTGACTTCCAGGCCCCTCTGCTTTCCTCTCACAACCACACCAGGTCTC'i 

ATGGAGTCCAGTGCCTGCAGTGACCCAACATAGACTGCACTTTCACCTACCTAC'rGGA 

TGGTCCTGCAGCCCAGACGGCTGCTCTTCTTTCTCATGGAGTTTCTCTCCTGCCTGAGA 

TATGCTATCTGGTCTGCCCCTGTGTAGCTCCCATGGGATCCCTTAAAATCGATCCTTT'r 

TTAA (SEQ ID NO. 55) 
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FIGURE 12AZ 

TTAGCACCTCOTGAGGAGACTGTTGTCCACAGGCCAGCTAGTGGTACCCTACTGAGAA 

GTTGGGTTTTGGTTTTGTTTCCCTTGAAGGGTCGCTGTTAGAGGATGGAAGTAACTTCT 

AATTCTTGATCTGTTTGTTGGTCTTGTTTTCAGTACTTTTTGCCAGTTGTATACACTTGG 

AGAGGGAATTTGTATGCCTGTAATCTTGTTCTTGAGGTCAGAAATTCAAAACATTGGG 

AGCTTTTGTTGTAAAGGTTAAACTGTGAATCCATATAGCAAATGCAGATCCTTTTACA 

GTGTAAACCACATTTCCTGCCTCAGCCTAAAGCACTGGTCATTT (SEQ ID NO. 56) 

FIGURE 12BA 

ACCTGCATGCCTAAAGGAGTAGGCTTAGGGGTGGGGAGAGAGAAGGCATAGGCTTTT 

CTAGTTATACAAAGCTGTGTAAGGCAAGGTTCCTTTCTACTAAATGGTCAGCTGTCACT 

ACATTTATACTTTTGTATGTCATAAACCCTTTCTTTCA'rTCCTCCCTGGGTAACCAGGA 

CAATCGGAGGGCAGTGTGTTACTGGGATTAGAGGACTAGCAATACTGGGTAACCCGCC 

TAAGCTGGAAGGTGACGTAATACGTTTCTTTAAAGATTCAGTCAGTCAAGCAGTTTAG 

CAATATCAAAATGTCTGGCTGTTTGGTCCAGTGTACACTGTT (SEQ ID NO. 57) 



FIGURE 12BB 

GCTATCTGCGAAACTACAGAAAGGAAGACAGCTTGGCCCAGCGCGGTGAAGTTCAGA 

ArTCACTAGGTAGTTGTTGTTGGTTGACTTGCfAGGTAGCTGGGTAATCAACAGCTTTCA 

CTTTAGATTCAATGTGAACCGCAGAGTTACTCATGACCAAGAGTCTGGCAAACTCATT 

AATGCTGTTTAATACTTGTTTGATATTTTTTCACCTTTTGAGCCCTTTTCCCAAAGAATT 

CAATATCAGTTTAGTAGCAACAGTACAGTTGCCATTTAAATTGGTTTAGTTGCAGTATA 

GCA (SEQ TD NO. 58) 

FIGURE 12BC 

GCTATCTGCGAAACTACAGAAAGGAAGACAGCTTGGCCCAGCGCGGTGAAGTTCAGA 
ATTCACTAGGTAGTTGTTG-ITGGTTGACTTGGAGGTAGCTGGGTAATCAACAGCTTTCA 
CTTTAGATTCAATGTGAACCGCAGAGTTACTCATGACCAAGAGTCTGGCAAACTCATT 
AATGCTGTTTAATACTTGTTTGATATTTTTTCACCTTTTGAGCCCTTTTCCX'AAACiAATT 
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CAATATCAGTTTAGTAGCAACAGTACAGTTGCCATTTAAATTGGTTTAGTTGCAGTATA 
GCA (SEQ ID NO 59) 

FIGURE I2BD 

GCTATACTGCAACTAAACCAATTTAAATCtGCAACTGTACTGTTGCTACTAAACTGATA 
TTCjAATTCTTTGGGAAAAGGGCTCAAAAGGTGAAAAAATATCAAACAAGTATTAAAC 
AGCAl'TAA'rGAGTTTGCCAGACTCTTCiGTCATGAGTAACTCTGCGGTTCACATTGAATC 
TAAAGTCtAAAGCTGTTGATTACCCAGCTACCTCCAAGTCAACCAACAACAACTACCTA 

gtgaattctgaacttcaccgcgctgggccaagctgtcttcc 

(SEQ [D NO. 60) 

FIGURE 12BE 

gctatactgcccaccacal'tgccacactcggaatgacatttctatattttcacctcccc 
agatttccatttcttcatcgtaacttccaatgtgctcaaaataitttttagatatagaa 
aaaaggcctcctgcaaaggtgggggtcttaattgggtaggtttcatctttccttctttg 
cttctcatgatcaggaagtgactcccagccaaaggaaaggctccagtcaaaatttcca 
cggttatggttgcttccgtacggagaaggcttgttgaattcaaatgtgtttagatctat 
ggatgcgatgtctggactcaccacggca 

(SEQ FD NO. 6 1) 

FIGURE 12BF 

GCTATACTGCTGAAGGAGATCATTTTGGTGGATGATGCTAGTGTAGACGACTACCTGC 
ATGAAAAGCTGGAGGAATACATAAAACAGTTTTCTATTGTGAAAATAGTCAGGCAGC 
AAGAAAGGAAAGGCCTGATCACCGCGCGGTTGCTAGGGGCAGCTGTAGCAACTGCCG 
AGACGCTCACGTI'CTTAGATGCTCACTGTGAGTGC'I'TCTA'rGGCTGGCTGGAACCTCl'G 
CTGGCCAGGATAGCTGAGAACTACACTGCCG (SEQ ID NO. 62) 

FIGURE I2BG 

AGTTGC:CAGGGGGCAGCTCACGGCGCAGCTCATCCTCTGTGATGTAATTCTTATCTCC 
AGCCAGGATCTTGAAGGAAGCCATCiACCTGA'rCTGCAGTATCAGTATCTGCCGTCTCT 
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CGGGACATAAAGTCGATG/VAGGCCTGGAACGTCACTACCCCCAAGCGGT'rGGCKiTC'r 
ACAATGCTCATGATTCGGGCAAACTCTGCCTCTCCCATGTTGTAACCCATCKiAGATAA 
GGCAGGCGCGGAAATCGTCTGTGTCCATCATGCCCGTCTTCTTCCGGTCAAAGTGGTT 
GAAAGA (SEQ ID NO. 63) 

FIGURE 12BH 

AAGCCGTGTCGCTGAACTGGGAGGACACACTGCTCACCCTAGAAGGCTCTGGCTGACC 

CTCCGCCCGGTTAAACAGGGACTTTGTGGCCATGTGCTGGCGACACAGGTCCTGGTAC 

TCAAAAGTAGTGTCACCATGGGCCCCCTCCGGCCCCACiCCiCTGCCAGGCGTCCTTATC 

CCGCTGTCTCGAATGATGGCGCATACCAAGGCCACTGAAAGCCACTAGCAGCCCAGCG 

ACGCCTGCCAGGGCCACTAGAGTAAGCAGCACTGAGCGCATGGGAGATATGCCAT 

(SEQ ID NO. 64) 

FIGURE 12B1 

AAGCCGTGTCTGGACGTCCGTGTGTCCGGCTCTTGC'i'CACGCAGTCATGGCCTCCGGA 

ACGCGCAAATCGGAAAGTCGGCTCCTGACTTCACGGCCACAGCGGTGGTGGATGGTGC 

CTTCAAGGAAATCAAGCTTTCGGACTACAGAGGGAAGTACGTTGTCCTCTTTTTCTACC 

CACTGGACTTCACTTTTGTTTGCCCCACGGAGATCATCGCTTTTAGCGACCATGCTGAG 

GACTTCCGAAAGC'rAGGCTGCGAGGTGCTGGGAGTGTCTGTGGACTCTCAGTTCACCC 

ACCTGGCGTGGATCAATACCCCACGGAAAGAGGGAGGCTT (SEQ ID NO. 65) 

FIGURE 12BJ 

AAGCCGTGTCGGAGGGCACCAAGGCTGTCACCAAGTACACCAGCTCCAAGTGAGTGC 
TCAAGACTCAGCTCTTAACCCAAAGGCTCTTTTCAGAGCCACTCAAGACTTCAAAATT 
GGAGCTTTAATGCTCiACTTAGTGACTACCGGGAAAATAACTGAC'JTCATCTGCAGGAT 
TGTGTACAAACACTTATGGTTTAGTAAATCGAAAAGATAGACATTGCCCATCAGTTCT 
GTCTGGTCCACTTAAATATGCTTTTTTCTTAGAAGTTCTAAGAACCCTGTCAATAACCT 
ATCTAGGTCCAGTCCTTGAGTTCAAAGGCCAAATACCAATG (SEQ ID NO 66) 



FIGURE I2BK 
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CAACGCTCAGGATGTAAGCTGTTTCCAGCACCTGGTTCAAGCGAATG'rAAOAAATAAO 

AAGGTGTTGAAAGATGCCGTGAATAACATTACAGCAAAGGGGA'J'CACAGATTACAACi 

AAAGGCTTTAGCTTTGCCTTCGAACAGCTACTTAATTATAATGTTTCCAGAGCTAATTG 

CAATAAGATTATCATGTTATTCACGGATGGAGGAGAAGAGAGAGCCCAGGAGATATT 

TGCCAAATACAATAAAGACAAAAAAGTCCGTGTGTTTACATTTTCCGTCGGTCAACAT 

Ay\TTATGACAGAGGACCTATTCAGTGGATGGCTTGTGAAATAAAGGTTACTATTATGA 

GATTCCTCCATT (SEQ ID NO. 67) 



FIGURE 12BL 

TCAACGCTCATCACACCAAGAATCAACTGGTTCTTCAAGTTTGTCTTATTTTCAGATTG 
GCCAGTGACGTTGAAGACTGGTAGAGTTCCAGTAATGACAAGTCCCAGTTCCAGGGCA 
TCCAAATACACATTTGTCCATTGAACTTGCTTCGCTTTGTCACCAGCTAAAACCATTGG 
TCTTCCCAGAACATCTAGATATTCCTGAGTATTGATTCTTATTGCACCAATCiGAGGGAA 
TCTCATAATAGTAACCTITATTTTCACAAGCCATCCACTGAATAGGTCTCTGTCATAAT 
TATGTTGACCGACGGAAATGTAA 

(SEQ ED NO. 68) 

FIGURE 12BM 

TAACGCTCAGGAGAAGAATAGGAATGCAGAGAACTCTGCCACAGCCCCCACGCTCCC 

GGGCAGCACCTCAGCCACCACCGCAACCACCACCCCTGCTGTAGATGAAAGCAAGCCT 

TGGAACCAGTATCGCTTGCCTAAGACTCTTATACCTGACTCCTACCGGGTGATCTTGAG 

ACCCTACCTCACCCCCAACAATCAGGGCCTGTACATCTTCCAAGGCAACAGTACTGTT 

CGCTTTACCTGCAACCAGACCACGGATGTCATTATCATCCACAGCAAAAAGCTCAACT 

ACACCCTCAAAGGAAACCACAGGGTGG 

(SEQ ID NO. 69) 

FIGURE 12BN 

CGAGTCAGACGGCTTCAGCATCGAGACCTGTAAGATCATGGTGGACATGCTGGATGAA 
GATGGGAGTGGCAAGCTTGGCCTGAAGGAGTTCTACATCCTCTGGACGAAGATTCAGA 
AATACC.AAAAAATCTACCGGGAAATCGATGTCiGACACiGTCTGGAACTA'rGAATTCCTA 
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CGAGATGCGCfAAAGCACTGGAAGAAGCAGGTTTCAAGCTGCCCTGTCAACTCCATCA 
AGTCATCGTTGCCCGGTTTGCAGACGACGAGCTAATCATCGACTTTGACAATTTTG 

(SEQ ID NO. 70) 

FIGURE 12BO 

CGAGTCAGACAACCTGTTCAAGTGGGGTGGGGACCATCCACGGAGCAGCCGGCACCG 

TATATGAAGACCTGACxGTACAAACTCTCCCTAGAGTTCCCCAGCGGCTACCCTTACAA 

CGCACCCACAGTGAAGTTCCTCACACCCTGCTACCACCCCAACGTGGACACCCAGGGC 

AACATCTGCCTGGACATCCTCAAGGATAAGTGGTCTGCACTATATGATGTCAGGACTA 

TCTTGCTCTCTATCCAGAGCCTGCTAGGAGAACCCAACATCGATAGCCTTTGAACACA 

C ACGCTGCGGAACTCTGGAAAA (SEQ ID NO. 7 1 ) 

FIGURE 12BP 

TATGAGTCCGGAGCGACGGCTACGAGTGTGAACTGTTCCAGCCCCGAGCGACACACCA 

GAAGTTATGACTACATGGAAGGAGGGGATATAAGGGTGAGAAGACTGTTCTGTCGCA 

CCCAGTGGTACCTGAGGATTGACAAACGAGGCAAAGTGAAAGGGACCCAGGAGATGA 

AGAACAGCTACAACATCATGGAAATCAGGACCGTGGCAGTTGGAATTGTGGCAATCA 

AAGGGGTGGAAAGTGAATACTATCTTGCCATGAACAAGGAAGGGAAACTCTATGCAA 

AGAAAGAATGCAATGAGGATTGCAACTTCAAAGAACTGATTCTGGAAAACCATTATA 

ACACCTATG (SEQ ID NO. 72) 

FIGURE 12BQ 

TATGAGTCCGAGGAGGAGCACAATGCTGGGAGTGTGGAAAGCCAGGTTGTCCCCAGC 

ACACACCGAGTGACCGATTCCAAGTTCCATCCACTCCATGCCAAGATGGATGTCATCA 

AAAAAGGCCACGCCAGGGACAGCCAGCGCTACAAAGTTGACTATGAGTCTCAAAGCA 

CAGACACCCAGAACTTCTCCTCCGAGTCTAAGCGCiGAGACAGAATACGG'I'CCCTGCCG 

CAGAGAAATGGAGGACACACTGAATCATCTGAACiTTCCTCAATGTGCTGAGTCCAGAG 

TCTCACATCCAAACTGTGACAAGAAGGGG 

(SEQ ID NO. 73) 



FIGURE I2BR 
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TCGCCCGGGACTTCATGCGATTGAGAAGATTGTCTACCAAATATAGAACAGAAAAGAT 

TTATCCCACAGCCACTGGAGAAAAAGAAGAAAATGTTAAAAAGAACAGATATAAGGA 

CATACTGCCATTTGATCACAGCCGAGTTAAGTTGACTTTGAAGACTCCATCCCAAGAT 

TCAGATTATATCAATGCAAATTTTATTAAGGGTGTGTATGGGCCAAAAGCATATGTGG 

CAACCCAAGGGCCTTT 

(SCQIDNO. 74) 

FIGURK I2BS ■ 

TGTGGAAAGCCAGGTTGTCCCCAGCACACACCGAGTGACCGATTCCAAGTTCCATCCA 
CTCCATGCCAAGATGGATGTCATCAAAAAAGGCCACGCCAGGGACAGCCAGCGCTAC 
AAAGTTGACTATGAGTCTCAAAGCACAGACACCCAGAACT'rCTCCTCCGAGTCTAAGC 
GGGAGACAGAATACGGTCCCTCiCCGCAGAGAAATGGAGGACACACTGAATCATCTGA 
AGTTCCTCAATCrrOCTGAGTCCAGAG 

(S1£Q ID NO. 75) 

FIGURE 12BU 

TGACCATCGAAGTGCAAAGGAAATGACTTGATTTCATGAAGTATCTCCAGAAGTAACG 

CTTTGTTTTCTGCATCCTGAACTTTATTCCCAGTGAAGAGCTGAAAATCTGGACGCTCA 

AAAAATGGAAGCACTTTGGAGAGAGCCCTTAACTCTATCAGGTACAGGAAGTACAAG 

TTCCTCAGCCTTCGTGGGCCTTCTCCTTCAGTCAGAATCCCATCAAAGCGCTGCTGGAA 

CTCTGTGACATTGTGACCCCATTTCTTTTCCAGCCAAGTATCTTGTAAAAGATACCTTG 

CACTCAAATGCACATTAATGCTTGCGTGCAGGCCAGATATAAGTCTGTAGAATCGCTC 

TT'l'CTACACAGAGGCCTTCTAGCCAGTTGTAAA 

(SEQ ID NO. 76) 

FIGURE I2BV 

CTGCTTGATGCTAAGCCCGGCAGCCTGTGTTTCATCTACAGGATCiCACAACATAAAAG 

AAA^GATCTGATTCCCGCAGGTTCTCTTCTGACCTACACACACACACACTAAAATAAC 

ATTTAAAAATATGTGCCAAATTATATTTGTTCGGGTGCCACCTTCCACCAGCTTACCAC 

TACGGTAGAACTGTCAAATTCATCTCCCTGAA'ITTGTCTTAAAGGGGI'GTCCATGCAC 

AGGCCCAAGAGTCACCTCCAATGAAATAAATGTA^TACTGAAGTATGCCATGATGTTT 
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gttgttttctttcatcgtaagcctgtaacicaggaaaaatagi'aatagatagaatagag 
acttaccagtggtcgatggcctggtcagtct(itc;cck;t(;actaggaccagg 

(SHQ ID NO. 77) 

FIGURE 12BW 

ACCTGCATGCCGAGTGTGACGCCTTTGAGGAGAAGATCCACiGCTGCCGGAGGGATCG 

AACTCTTTGTCGGAGGCATTGGCCX'CGATGCiACACATTCiCCTTCAATGAGCCAGGCTC 

CAGLX'TGGTGTCCAGGACCCGTGTGAAGACTCTGGTTATGGACACCATCCTGGCCAAC 

GCTAGGTTCTTTGATGGTGATCTTGCCAAGGTGCCCACCATGGCCCTGACAGTGGGTG 

TCGGCACTGTCATGGATGCTAAAGAGGTGATCjATCCTCATCACAGGCGCTCACAAGGC 

C TT T G C TC T G T A C AAAG C C A TC G A T G G A GG C G T G AA C C A C A TG T G G A CGG TGTG 

(SEQ ID NO. 78) 

FIGURE 12BX 

GCTATACTGCAATGTTAGGGGAATGAACGCGTTTTCCTACTGCACTGGGGACTTTTAG 

ATAGGTTAATGAAAGGCCTTTTATTCTGTTACTGGACACGAAAAC1TTGTCTAATTTCT 

TATACTCTATTGTACGTTTACAGTCGCAGCAC'I'AAAATGGAAGACATCAAACATTTTT 

AACAGAAAAAAAAAAAGATGTAAAAACTAACTAAGGACTA'ITTATTGATAATGTm 

CTACTCCTGTCAGACAATGGCTATAAAC'I'GAATTAGGCAGTCTTAAAAAAAAAAAAAG 

AAAAAAAAGAAAAAAGAAAA7\AAGAAAAGAAAAGAAAAAAA^ 

(SEQ ID NO. 79) 

FIGURE 12BY 

AGC'rAAGC/rCGGGIACTC'rGATACTTCAGAGTTTAAAATCATCAGCCCTTGTAGATCT 
ATTCXTAAATCTTATGAAAATGCTCAGATGTTTACACAGCTGTGAAACAGGGTCAGTT 
CAGATCGCTGA rGGCTTGAGAATGTGTTTCTI'GTTGACATCAGGAACTGGAAATGTTT 
ACTTCCCGTCATT'rATGAGTCATCAAGTATCTCGGCTCTTTTAAGAGCGCAAGATAAA 
ACAAGCTTAAACCAGGTGATAAGAGCAGAGTCCACTTGAGTCTGAGCTCACCCGAGA 
ACTTGCTATCGAGGACATTTGGAATGGGAGTGTGCAGGCTTCCTTCAGTTACTGAATG 
AGTCCATCTGCTAGTCACCTTGAC 

(SEQ ID NO. SO) 
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FIGURE 12BZ 

ACiCTAAOGTCCAGC.GGCCAAAGCGGTGACGTGTOCACATCGATATGAGAAACGGCAG 

CACGTCAACACGAAGCAGGAGTCGCGGGATATCTTTGGAAGATGTTATGTCCTAAGTC 

AGAATCTCAGAATTGAAGATGATATGGACGGAGGAGACTGGAGTTTCTGCGATGGCC 

GGTTGAGAGGCCATGAAAAGTTTGGCTCCTGTCAGCAAGGAGTAGCGGCTACTTTCAC 

TAAGGACTTTCATTACATTGTTTTTGGAGCCCCAGGGACTTACAACTGGAAAGGGATC 

GTCGTGTAGAACAAAAGAATAACACTTTTTT 

(SKQIDNO. 81) 

FIGURE 12CA 

AAGCCGTGTCTGTGCTCAAGGAAGAAACCCACTGGACCAACTTCTGTCAGAAAGGAA 

AACCTTGTTCAAAGTTTCAGGACCCTGTTCT'ITGCTTATTTGCACATGGTCACCTTGGT 

CTGAGCTAGCCACCATTGTCACCCACAGCTGCAAAGAAAGCAGACCTTAGGAAACACT 

GTCACGGCTGAGTGTGACTGCCTTGTTCATCCCCTGGACTGGTACTG'I'GTTGCCTGCAG 

TACCATTGGGATCCCATAGCAAGAGAGGGAGAGGGAGATGTTAGTTAGCCTTTGCTAC 

GAACCAAGCTGTCCCAAGTCTCAACAGCTAAACAGGTATTCAT1TACCATGATTCTAT 

GGTTAGCTAAGCTCTTGAG 

(SEQ ID NO. 82) 

FIGURE 12CB 

CTTTCTACCCTGGAGGATGTGCTTGAGGCACACTGCTCCTGTGCTCTCCACTTGAGGCA 

TAAGCCCAGTCAGTTGTGCATAGATGATTAACCTCTGACCCCTAAAGATGGTAAGTTG 

CTCTGGAGAAAGCATTTTAACAGACAAACCAGGAGGCAAATCCCAACTTAGAGAGAT 

GTTATCCACTGCACACTGTAGAGCAAACTTGAGAGACCCAAGAGCCTTGGTCTGCATC 

CTGTCCTTGCCTGTGATAAACACTCGAGTACCCCCTGATACCGGGCGATATTTTTGATT 

AACTGGTCGAGGCTCCTTGTCCAATTCCAAAAGAGAACATCTGTGTTTC 

(SEQ ID NO. 83) 



FIGURE I2CC 
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TGGTAAAGGGCATCTGTAAATACACTCTATGAGGAAATTAAAACTTGAACATGGCAGT 

CTGACATTGCAAAACAAA.ACAAAACAAAACTGACCCTCCAAl'AGCAGCGAAAACAAC 

GTGAAAGATACAAAGCAATGAGAATCTGGTTCTGAACGCCTGGC.ATCCTGGGAGTCAT 

CGGTAGCAGCGCCATGAGAGGAGCCGTGGCCTGTCCCA'rGTGGTCCCACCTTCACCTC 

TTCCCTCACATCCCTCTTAAG 

(SEQ ID NO. 84} 

FIGURE I2CD 

TGGTAAAGGGGGCAAGGGCAAAGGCACGGGAGACAGAGGCCACTGCATCTGTACCCA 
CATCAGACATGTTTGTCCATTTTCTCTCATTTGGCCTTAGACCATTGGCAAGAGTAAAT 
GCTCTTAGTCCCGTTATCTAGAAATTTCTTCCTTTGGGGAGAACCACTTATAGACAATA 
TCAGCTCTCTACAAATAACACGAAAGGTCGTAACAC 

AGCAAGTGACCAGAAAGTGCCCGTCCTTGCGGCTCTGATCCACGTGGCTCTCCGTAGA 
CAAATTGITTTTTC'ITGTAGGGATATCTGTTTTGCTTCTGAACTTTCTTACAAGTGTTTG 
GGACTCrrCGGGTGGCGTT 

(SEQ ID NO. 85) 

FIGURE 12CE 

TGGTAAAGGGTCAAGTGTTCGATCAGAGTGGAGCTCCATTACCGAATGTAATCGTGGA 

AGTCCAAGACAGAAAGCATATCTGCCCGTTTAGAACCAACAAGCTTGGAGAATACTAT 

CTGCTTCTGCTGCCCGGGTCCTACGTGATCAATGTTACAGTCCCTGGACACGACTCCTA 

CCTCACGAAGCTTACTATTCCAGGGAAATCCCAGCCCTTCAGTGCTCTTAAAAAGGAT 

TTTCACCTCCCGCTGCGATGGCAGCCGGATrCCATCTCCGTATCCAATCCTTCGTGCCG 

ATGATTCCGCTGTACAAATTCATGCCAAGCCACTCGGCTGCCACAAAGCCTAGTCTGG 

G 

(SEQ ID NO. 86) 

FIGURE I2CF 

GAATTCGGCTTTCTGCGATCCACTCTTTGAAGCTA'I'TGGCAAGATATTCAGCAACATCC 
GCATCAGCACGCAGAAAGAGATATGAGGGACATTTCAAGGATGAAAGGTTTTTTTC(:C 
CCCTTAC'rATTTCCTTGGTGCCAATTCCAAGTTGCTCTCGCAGCAGCAAATTTATGAAT 
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GGTTTCiTCTTGATCAAGAACAAAGAATTCATTCCCACCATTCTCATATATACTACTTTC 
TCTTCTT 

(SEQ ID NO. 87) 

FIGURE 12CG 

GAATTCGGCTTTCTGCGATCCACTCTT'l'GAAGCTATTGGCAAGATATTCAGCAACATCC.' 
GCATCAGCACGCAGAAAGAGA'rATGAGGGACATTTCAAGGATGAAAGGTTTTTTTCCC 
CCCTTACTATTTCCTTGGTGCCAATTCCAAGTTGCTCTCGCAGCAGCAAAT'ITATGAAT 
GGTTTGTCTTGATCAAGAACAAAGAATTCATTCCACCATTCTCATATATCTACGTCTCT 
TCTAG 

(SEQ ID NO. 88) 

FIGURE I2CH 

ACGAGGGGAAACCTCCTCAGAGCCTGCAGCCAGCCACGCGCCAGCATGTCTGGGGGC 

A.AATACGTAGACTCCGAGGGACATCTCTACACTGTTCCCATCCGGGAACAGGGCAACA 

TCTACAAGCCCAACAACAAGGCCATGGCAGACGAGGTGACTGAGAAGCAAGTGTATG 

ACGCGCACACCAAGGAGATTGACCTGGTCAACCGCGACCCCAAGCATCTCAACGACG 

ACGTGGTCAAGATTGACTTTGAAGATGTGATTGCAGAACCAGAAGGGACACACAGTTT 

CGACGGCATCTGGAAGGCCAGCTTCACCACCTTCACTGTGACAAAATATTGGTTTTAC 

C G C TT G TT GT C T A C G A T C TT C G G C A TC CC AATGGC AC TC A TCT G G G G C A T TT A C TTT G C 

CATTCTCTCCTTCCTGCACATCTGGGCGGTTGTACCGTGCATCAAGAGCTTCCTGATTG 

AGATTCAGTGCATCAGCCGCGTCTACTCCATCTACGTCCATACCTTCTGCGATCCACTC 

TTTGAAGCTATTGGCAAGATATTCAGCAACATCCGCATCAGCACGCAGAAAGAGATAT 

GAGGGACATTTCAAGGATGAAAGGTTTTTTTCCCCCCTTACTATTTCCTTGGTGCCAAT 

TCCAAGTTGCTCTCGCAGCAGCAAATTTATGAATGGTTTGTCTTGATC 

(SEQ ID NO. 89) 

FIGURE 12CI 

MECL Y YFLGFLLLAARLPI.l)AAKRFFIDV1XiNERrSAYMRF;HNQLNGWSSDENDWN^KL 

YPVWKRGDMRWKNSWKGGRVQAVLTSDSPALVGSNITFAVNLIF 

PRCQKEDANGNIVYEKNCIlNEAGLSADI'YVYNWTAWSEDSDGENGTGQSmNVFPDGK 
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PFPHHPGWRRWNFIYWHTLGQYFQ^ 
YVPIAQVKDVYVVTDQIPWVTMFQKh^ 

YKWSFGDNTGLFVSTNI ITVNHTYVLNGTFSLNLTVKAAAPGPCPPPPPPPRPSKPTPSI.GP 
AGDNPLEl,SRO>DENCQIhmYGHFQATITIVEG^ 

QGSIPTEVCTIISnPTCEITQNTVCSPVDVDEMCLLTVIlRTFNGSGTYCVNITLGDDTSLAL 

TSTUSWDRDPASPLRMANSAU^ 

SVFLNRAKAVFFPGNQEKDPLLKNQEFKGVS 

(SEQ ID NO. 90) 

FIGURE 12BT 

] CAGATGCCAG AAGAACACTG TTGCTCTTGCi TGGACGGGCC CAGAGGAATT 
CAGAGTTAAA 

61 CCTTGAGTGC CTGCGTCCGT GAGAATTCAG CATGGAATGT CTCTACTATT 
TCCTGGGATT 

121 TCTGCTCCTG GCTGCAAGAT TGCCACTTGA TGCCGCCAAA CGATTTCATG 
ATGTGCTGGG 

181 CAATGAAAGA CCTTCTGCTT ACATGAGGGA GCACAATCAA TTAAATGGCT 
GGTCTTCTGA 

241 TGAAAATGAC TGGAATGAAA AACTCTACCC AGTGTGGAAG CGGGGAGACA 
TGAGG'i'GGAA 

301 AAACTCCTGG AAGGGAGGCC GTGTGCAGGC GGTCCTGACC AGTGACTCAC 
CAGCCCTCGT 

361 GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA 
AGGAAGATGC 

421 CAATGGCAAC ATAGTCTATG AGAAGAACTG CAGAAATGAG GCTGGTTTAT 
CTGCTGATCC 

481 ATATGTTTAC AACTGGACAG CATGGTCAGA GGACAGTGAC GGGGAAAATG 
GCACCGGCCA 

541 AAGCCATCAT AACGTCTTCC CTGATGGGAA ACCTTTTCCT CACCACCCCG 
GATGGAGAAG 



WO 97/18454 

601 ATGGAATTTC 

TGGGACGATO 

66 1 TTCAGTGAGA 

TCATGGAAGT 

72 1 GACTGTCTAC 

AAGATGTGTA 

781 CGTGGTAACA 

ATCGAAATTC 

84 1 ATCCGACGAA 

TTCATGATCC 

90 1 TAGCCACTTC 

ATAATACTGG 

961 CCTGTTTGTT 

GAACCTTCAG 

1021 CCTTAACCTC 

CACCACCACC 

1081 CAGACCTTCA 

TGGAGCTGAG 

1141 TAGGATTCCT 

CCACCATCAC 

1201 AATTGTAGAG 

TGATGCCGGT 

1261 GCCATGGCCT 

GCATTCCCAC 

1321 GGAGGTCTGT 

CAGTCTGCAG 

1381 CCCTGTGGAT 

ATGGGTCTGG 

1441 GACGTACTGT 

CGAGCACCCT 



30/61 

ATCTACGTCT TCCACACACT 
GTTTCTGTGA ACACAGCCAA 
AGAAGACATG GACGGGCATA 
GATCAGATTC CTGTGTTTGT 
ACCTTCCTCA AAGATCTCCC 
CTCAATTATT CTACCATTAA 
TCCACCAATC ATACTGTGAA 
ACTGTGAAAG CTGCAGCACC 
AAACCCACCC CTTCTTTAGG 
GATGAAAACT GCCAGATTAA 
GGAATCTTAG AGGTTAACAT 
GAAAGCTCCC TAATAGACTT 
ACCATCATTT CTGACCCCAC 
GTGGATGAGA TGTGTCTGCT 
GTGAACCTCA CCCTGGGGGA 



PCT/US96/ 18567 

TGGTCAGTAT TTCCAGAAAT 
TGTGACACTT GGGCCTCAAC 
TGTTCCCATC GCACAAGTGA 
GACTATGTTC CAGAAGAACG 
CATTATGTTT GATGTCCTGA 
CTACAAGTGG AGCTTCGGGG 
TCACACGTAT GTGCTCAATG 
AGGACCTTGT CCGCCACCGC 
ACCTGCTGGT GACAACCCCC 
CAGATATGGC CACTTTCAAG 
CATCCAGATG ACAGACGTCC 
TGTCGTGACC TGCCAAGGGA 
CTGCGAGATC ACCCAGAACA 
GACTGTGAGA CGAACCTTCA 
TGACACAAGC CTGGCTCTCA 
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1 50 i GATTTCTGTT CCTGACAGAG ACCCAGCCTC GCCTTTAAGG ATGGCAAACA 
GTGCCCTGAT 

1561 CTCCGTTGGC TGCTTGGCCA TATTTGTCAC TGTGATCTCC CTCTTGGTGT 
ACAAAAAACA 

162 1 CAAGGAATAC AACCCAATAG AAAATAGTCC TGGGAATGTG GTCAGAAGCA 
AAGGCCTGAG 

1681 TGTCTTTCTC AACCGTGCAA AAGCCGTGTT CTTCCCGGGA AACCAGGAAA 
AGGATCCGCT 

1741 ACTCAAAAAC CAAGAATTTA AAGGAGTTTC TTAAATTTCG ACCTTGTTTC 
TGAAGCTCAC 

1801 TTTTCAGTGC CATTGATGTG AGATGTGCTG GAGTGGCTAT TAACCTTTTT 
TTCCTAAAGA 

1861 TTATTGTTAA ATAGATATTG TGGTTTGGGG AAGTTGAATT TTTTATAGGT 
TAAATGTCAT 

1921 TTTAGAGATG GGGAGAGGGA TTATACTGCA GGCAGC'ITCA GCCATGTTGT 
GAAACTGATA 

1981 AAAGCAACTT AGCAAGGCTT CTTTTCATTA TTTTTTATGT TTCACTTATA 
AAGTCTTAGG 

2041 TAACTAGTAG GATAGAAACA CTGTGTCCCG AGAGTAAGGA GAGAAGCTAC 
TATTGATTAG 

2101 AGCCTAACCC AGGTTAACTG CAAGAAGAGG CGGGATACTT TCAGCTTTCC 
ATGTAACTGT 

2161 ATGCATAAAG CCAATGTAGT CCAGTTTCTA AGATCATGTT CCAAGCTAAC 
TGAATCCCAC 

2221 TTCAATACAC ACTCATGAAC TCCTGATGGA ACAATAACAG GCCCAAGCCT 
GTGGTATGAT 

22X1 GTGCACACTT GCTAGACTCA GAAAAAATAC TACTCTCATA AATGGGTGGG 
AGTATTTTGCi 

2341 TGACAACCTA CTTTGCTTGG CTGAGTGAAG GAATGATATT CATATATTCA 
TTTATTCCAT 
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2401 GGACATTTAO TTAGTGCTTT TTATATACCA GGCATGATGC TGAGTGACAC 
TCTTGTGTAT 

2461 ATTTCCAAAT TTTTGTATAG TCGCTGCACA TATTTGAAAT CATATATTAA 
GACTTTCCAA 

2521 AGATGAGGTC CCTGGTTTTT CATGGCAACT TGATCAGTAA GGATTTCACC 
TCTGTTTGTA 

2581 ACTAAAACCA TCTACTATAT GTTAGACATG ACATTCTTTT TCTCTCCTTC 
CTGAAAAATA 

264 1 AAGTGTGGGA AGAGACAAAA AAAAAAAAA // 

(SEQ ID NO. 91) 

FIGURE 12CJ 

AAGGTGAAAGATGTGTATGTGATAACAGATCAGATCCCTGTATTCGTGACCATGTCCC 
AGAAGAATGACAGGAACTTGTCTGATGAGATCTTCCTCAGAGACCTCCCCATCGTCTT 
CGATGTCCTCATTCATGATCCCAGCCACTTCCTCAACGACTCTGCCATTTCCTACAAGT 
GGAACTTTGGGGACAACACTGGCCTGTTTGTCTCCAACAATCACACTTTGAATCACAC 
TTATGTGCTCAATGGAACCTTCAACCTTAACCTCACCGTGCAAACTGCAGTGCCCGGG 
CCATGCCCTCCCCCTTCGCCTTCGACTCCGCCTCCACCTTCGTA 

(SEQ ID NO. 92) 

FIGURE 12CK 

AAGGTGAAAGATGTGTATGTGATAACAGATCAGATCCCTGTATTCGTGACCATGTCCC 
AGAAGAATGACAGGAACTTGTCTGATGAGATCTTCCTCAGAGACCTCCCCATCGTCTT 
CGATGTCCTCATTCATGATCCCAGCCACTTCCTCAACGACTCTGCCATTTCCTACAAGT 
GGAACTTTGGGGACAACACTGGCCTGTTTGTCTCCAACAATCACACTTTGAATCACAC 
TTATG TGCTCAATGGAACCTTCAACCTTA 

(SEQ ID NO. 93) 

FIGURE 12CL 

AAGGTGAAAGATGTGTATGTGATAACAGATCAGATCCCTGTATTCGTGACCATGTCCC 
AGAAGAATGACAGGAACTTGTCTGATGAGATCTTCCTCAGAGACCTCCCCATCGTCTT 
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CGATGTCCTCATrCATGATCCCAGCCACTTCCTCAACGACTCTGCCATTTCCTACAAGT 
GGAACTTTGGGGACAACACTGGCCTGTTTGTCTCCAACAATCACACTTTGAATCACAC 
TTATGTGCTCAATGGAACCTTCAACCTTAACCTCACCGTGCAAACTGCAGTGCCCGGG 
CCATGCCCTCCCCCTTCGCCTTCGACTCCGCCTCCACCTTCGTA (SEQ ID NO. 94) 

FIGURE 12CM 

T A C G A A G GT G G AGG CGGAGTCG AA GG C G A A G G GO G A ( i G G C AT G G C C C GG GC A C '1 ' GC A 
GTTTGCACGGTGAGGTTAAGGTTGAAGGTTCCATTGAGCACATAAGTGTGATTCAAAG 
TGTGATTGTTGGAGACAAACAGGCCAGTGTTGTCCCCAAAGTTCCACTTGTAGGAAAT 
GGCAGAGTCGTTGAGGA 

(SEQ ID NO. 95) 

FIGURE 12CN 

AAGGTGAAAGATGTGTATGTGATAACAGATCAGATCCCTGTATTCGTGACCATGTCCC 
AGAAGAATGACAGGAACTTGTCTGATGAGATCTTCCTCAGAGACCTCCCCATCGTCTT 
CGATGTCCTCATTCATGATCCCAGCCACTTCCTCAACGACTCTGCCATTTCC'rACAAGT 
GGAACTTTGGGGACAACACTGGCCTGTTTGTCTCCAACAATCACACTTTGAATCACAC 
TTATGTGCTCAATGGAACCTTCAACCTTAACCTCACCGTGCAAACTGCAGTGCCCGGG 
CCATGCCCTCCCCCTTCGCCTTCGACTCCGCCTCCACCTTCGTA 

(SEQ ID NO. 96) 

FIGURE I2CO 

RRWRSRRi^GRAWPGHCSLHGEVKVEGSIEHlSVIQSVIVGDKQASVVPKVPLVGNGRV 
VEEVAGIMNEDIEDDGEVSEEDURQVPVILLGIJGHJ:YRDLICYH1HIFH1- 

(SEQLDNO. 97) 

FIGURE 12CP 

KVKDVYVITDQIPVTVTMSQKNDRNLSDEIP'LRDLPIVF'DVITI IDPSHFLNDSAISYKWNFG 
DNTGLFVSNNHTLNI-ITYVl.NGTFNI-Nl.TVQTAVPGPCPPPSPSTPPPPS (SEQ ID NO. 98) 
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FIGURE 12CQ 

YEGGGGVEGEGGGHGPGTAVCTVlliJ^LKV 

LRKWLGS*MRTSKTMGRSLRKISSDKI^SFFWi3MVTNTGI*SVITYTSFT (SEQ ID NO. 99) 
FIGURE 12CR 

MECIA'YFiriFLLLAARLPIJDAAKRFHD^GNKRPSAYMREHNQI.NC5WSSDENDWNEKL 
\TVWKRGDMRWKNSWKGGRVQAVi;^ 

K N C R N E A G I, S A D P Y V Y N W T A W S E D S D G E N G T G Q S H 1 1 N V F P D G K 
PFPHHPGWT^WNFIYVFHTIXiQYFQ 

YVPIAQVKDVYVVTDQIPVFVT^OKNDRNSSDETFLKDLPIMFDVLrmPSFrFLNYSTIN 
YKWSFGDNTGLFVSTNI-ITNTNHTY VI.NGTFSLNLTVKAAAPGPCPPPPPPPRPSKPTPSLGP 
AGDNPLELSRJPDENCQIN^YGHFQATITIVEGLL^ 

QGSIPTEVCTIISDPTCEITQNTVCSPVDVDEMCLLTVRRTFNGSGTYCVNLTLGODTSLAL 
TSTLISVPDRDPASPLRMANSALISVGCLAtf^^ 

SVFLNRAKAVFFPGNQEKDPLLKNQEFKGVS (SEQ ID NO. 100) 

FIGURE 12CS 

1 CAGATGCCAG AAGAACACTG TTGCTCTTGG TGGACGGGCC CAGAGGAATT 
CAGAGTTAAA 

61 CCTTGAGTGC CTGCGTCCGT GAGAATTCAG CATGGAATGT CTCTACTATT 
TCCTGGGATT 

12! TCTGCTCCTG GCTGCAAGAT TGCCACTTGA TGCCGCCAAA CGATTTCATG 
ATGTGCTGGG 

181 CAATGAAAGA CCTTCTGCTT ACATGAGGGA GCACAATCAA TTAAATGGCT 
C.GTCTTCTGA 

241 TGAAAATUAC TGGAATGAAA AACTCTACCC AGTGTGGAAG CGGGGAGACA 
TGAGGTGGAA 

301 AAACTCCTGG AAGGGAGGCC GTGTGCAGGC GGTCCTGACC AGTGACTCAC 
CAGCCCTCGT 

361 GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA 
AGGAAGATGC 
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42! CAATGGCAAC ATAGTCTATG 
CTGCTGATCC 

481 ATATGTTTAC AACTGGACAG 
GCACCGGCCA 

541 AAGCCATCAT AACGTCTTCC 
GATGGAGAAG 

601 ATGGAATTTC ATCTACGTCT 
TGGGACGATG 

661 TTCAGTGAGA GTTTCTGTGA 
TCATGGAAGT 

721 GACTGTCTAC AGAAGACATG 
AAGATGTGTA 

781 CGTGGTAACA GATCAGATTC 
ATCGAAATTC 

841 ATCCGACGAA ACCTTCCTCA 
TTCATGATCC 

901 TAGCCACTTC CTCAATTATT 
ATAATACTGG 

961 CCTGTTTGTT TCCACCAATC 
GAACCTTCAG 

1021 CCTTAACCTC ACTGTGAAAG 
CACCACCACC 

1081 CAGACCTTCA AAACCCACCC 
TGGAGCTGAG 

1141 TAGGA 1TCCT GATGAAAACT 
CCACCATCAC 

1201 AATTGTAGAG GGAATCTTAG 
TGATGCCGGT 

1261 GCCATGGCCT GAAAGCTCCC 
GCATTCCCAC 



AGAAGAACTG CAGAAATGAG GCTGGTTTAT 
CATGGTCAGA GGACAGTGAC GGGGAAAATG 
CTGATGGGAA ACCTT'i TCCT CACCACCCCG 
TCCACACACT TGGTCAGTAT TTCCAGAAAT 
ACACAGCCAA TGTGACACTT GGGCCTCAAC 
GACGGGCATA TGTTCCCATC GCACAAGTGA 
CTGTGTTTGT GACTATGTTC CAGAAGAACG 
AAGATCTCCC CATTATGTTT GATGTCCTGA 
CTACCATTAA CTACAAGTGG AGCTTCGGGG 
ATACTGTGAA TCACACGTAT GTGCTCAATG 
CTGCAGCACC AGGACCTTGT CCGCCACCGC 
CTTCTTTAGG ACCTGCTGGT GACAACCCCC 
GCCAGATTAA CAGATATGGC CACTTTCAAG 
AGGTTAACAT CATCCAGATG ACAGACGTCC 
TAATAGACTT TGTCGTGACC TGCCAAGGGA 
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1321 GGAGGTCTGT ACCATCATTT CTGACCCCAC CTGCGAGATC ACCCAGAACA 
CAGTCTGCAG 

1381 CCCTGTGGAT GTGGATGAGA TGTGTCTGCT GACTGTGAGA CGAACCTTCA 
ATGGGTCTGG 

1441 GACGTACTGT GTGAACCTCA CCCTGGGGGA TGACACAAGC CTGGCTCTCA 
CGAGCACCCT 

1501 GATTTCTGTT CCTGACAGAG ACCCAGCCTC GCCTTTAAGG ATGGCAAACA 
GTGCCCTGAT 

1561 CTCCGTTGGC TGCTTGGCCA TATTTGTCAC 'TGTGATCTCC CTCTTGGTGT 
ACAAAAAACA 

1621 CAAGGAATAC AACCCAATAG AAAATAGTCC TGGGAATGTG GTCAGAAGCA 
AAGGCCTGAG 

1681 TGTCTTTCTC AACCGTGCAA AAGCCGTGTT CTTCCCGGGA AACCAGGAAA 
AGGATCCGCT 

1741 ACTCAAAAAC CAAGAATTTA AAGGAGTTTC TTAAATTTCG ACCTTGTTTC 
TGAAGCTCAC 

1801 TTTTCAGTGC CATTGATGTG AGATGTGCTG GAGTGGCTAT TAACCTTTTT 
TTCCTAAAGA 

1861 TTATTGTTAA ATAGATATTG TGGTTTGGGG AAGTTGAATT TTTTATAGGT 
TAAATGTCAT 

1921 TTTAGAGATG GGGAGAGGGA TTATACTGCA GGCAGCTTCA GCCATGTTGT 
GAAACTGATA 

1981 AAAGCAACTT AGCAAGGCTT CTTTTCATTA TTTTTTATGT TTCACTTATA 
AAGTCTTAGG 

204 1 TAACTAGTAG GATAGAAACA CTGTGTCCCG AGAGTAAGGA GAGAAGCTAC 
TATTGATTAG 

2101 AGCCTAACCC AGGTTAACTG CAAGAAGAGG CGGGATACTT TCAGCTTTCC 
ATGTAACTGT 

2161 ATGCATAAAG CCAATGTAGT CCAGTTTCTA AGATCATGTT CCAAGCTAAC 
TGAATCCCAC 
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2221 TTCAATACAC ACTCATGAAC TCCTGATGGA ACAATAACAG GCCCAAGCCT 
GTGGTATGAT 

2281 GTGCACACTT GCTAGACTCA GAAAAAATAC TACTCTCATA AATGGGTGGG 
AGTATTTTGG 

2341 TGACAACCTA CTTTGCTTGG CTGAGTGAAG GAATGATATT CATATATTCA 
TTTATTCCAT 

2401 GGACATTTAG TTAGTGCTTT TTATATACCA GGCATGATGC TGAGTGACAC 
TCTTGTGTAT 

2461 ATTTCCAAAT TTTTGTATAG TCGCTGCACA TATTTGAAAT CATATATTAA 
GACTTTCCAA 

2521 AGATGAGGTC CCTGGTTTTT CATGGCAACT TC1ATCAGTAA GGATTTCACC 
TCTGTTTGTA 

2581 ACTAAAACCA TCTACTATAT GTTAGACATG ACATTCTTTT TCTCTCCTTC 
CTGAAAAATA 

264 1 AAGTGTGGGA AGAGACAAAA AAAAAAAAA // (SEQ ID NO. 101) 

FIGURE 12FN 

MECLYYFLGFLLI.AARLPLDAAKRFHDVLGN1£RPSAYMREI INQLNGWSSDENDWNEKL 

YP\WKRGDMRWK>JSWKGGRVQAVLTSDSP^^ 

KNCRNEAGLSADPYVYTsWTAWSEDSDGENGTGQSHHNWPDG^ 

VFHTLGQYFQKLGRCSVRVSVNTANVTLGPQL^VTVYRRI-IGRAYWIAQVKDVYVVT 

DQIPVKVTMFQKNDRNSSDETFLKDLPIMFDVl,^^ 

TNHTVNHTYVLNGTFSLNn^TVXAAAPGPCPPPPPJTRPSKPTPSLGPAGDNPLELSRJPDEN 
CQINRYGHFQATmVEGILEN^QMTDVI^ 

CEITONTVCSPVnvOEMCLLTVRJ^TFNGSGTYCVNI JLGDOTSLALTSTLISVPDRDPASP 
L R MA N S A L I S V G C L A IF VT VISE L V YKKHKE YN1 > I EN SI 'G N V VR S K G L S VFLNRAK A VF FP G 
NQEKDPLLKNQEFKGVS* (SEQ ED NO. 102) 



FIGURE 12CT 
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CTCiACCAGGAACCCACTCTTCTGTGCATGTATGTGAGCTOTCiCACiAAGTATCTGGCTG 
GGAACTGTTGTTCTCTAAGGATTATTGTAAAATGTATATCGTGGCTTAGGGAGTGTGG 
TTAAA'rAGCATTTTAGAGAAGAAAAAAAAAAAAAA.AAAAACTC(iAGAGTACTTCTAG 
AGCGGCCGCC.GCGCCATCGATTTTCCACCCGGGTGGGGTACCAGGTAAGTGTACCCAA 
TTCGCCTATAGTGAGT (SEQ ID NO. 103 ) 

FIGURE 12CU 

AGGACAAGCCAAGGACACTCTAAGTCTTTGGCCTTCCCTCTGACCAGGAACCCACTCT 
TCTGTGCATGTATGTGAGCTGTGCAGAAGTATGTGGCTGGGAACTGTTGTTCTCTAAG 
GATTATTGTAAAATGTATATCGTGGCTTAGGGAGTGTGGTTAAATAGCATTTTAGAGA 
AGACATGGGAAGACTTAG'I'GTTTCTTCCCATCTGTATTGTGGTT'ITTACACTGTTCGTG 
GGGTGGACACGCTGTGTCTGAAGGGGAGGTGGGGGTCACTGCTACTTAAGGTCCTAGG 
TTAACTGGGGGAGATACCACAGATGCTCAGCTTTCCACATAACATGGGCATGAACCAG 
CTAATCACACTGAA (SEQ ID NO. 104) 

FIGURE 12CV 

GGATCCTTCTCCTGGTCTCCTCGGAAGAACGGGGCTTTCGCGTGACTGAGGAGAACAC 
TCAGGCCCTTGCCCTTGACCGTGTTCCTGGGGCAGTrrCCTATTGGCTTGTACGCCTTG 
TGTTTTTTGTACAGCAAGATGGTAACCATGGTGACAAGCACAGCCAGGCAGCCGATGG 
AGATCAGGACACCATTCACTGCTCTCAGAGGGAGTCTGGGTCTTTGCCAGGGATAGAG 
ATCAGGGTGCTGGTGAGGGCCAGGCTTCGATCATCTCCCAGAGTGAAATTCACACAGT 
AGGTGCCAGACCCATTGAAGGCTCTTCTCACAGACAGCAGCACAGCCCATCCACAGCC 
ACAGGGCTGCAGACCCGGTTCTGGGCGATCTGGCAGGTGGGGTCGGAGATGATCGTA 
CAGGCTTCCATGGGGGTGGCCCCTTTGCAGGTCACAGTGAAGTCCATCAGGGAGTTGG 
CAGGCTGCGGTGTGGGCATGGCHTiACATCTGCTATCTGCATGAl'GCTGACTTCCAGGATCC 

(SEQ ID NO. 105) 

FIGURE I2CW 

TAGCAGATGTCCCCATGCCCACACCGCAGCCTGCCAACTCCCTGATGGACTTCACTGT 
GACCTGCAAAGGGGCCACCCCCATGGAAGCCTGTACGATCATCTCCGACCCCACCTGC 
CACiA'rCGCCCAGAACCGGGTCTGCACiCCCrGTGGCTGTGGATGGGCTGTGCTGCTGTC 
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TGTGAGAAGAGCCTTCAATGGGTCTGGCACCTACTGTGTGAATTTCACTCTGGGAGAT 
UATCGAAGCCTGGCCCTCACCAGCACCCTGATCTCTATCCCTGGCAAAGACCCAGACT 

ccctctgagagcagtgaat (seq id no. 106) 

figure 12cx 

ggatccttctcctggtctcctcy;,gaagaacggggctttcgcgtgactgaggagaacac 
tcaggcccttgcccttgaccgtgttcctggggcagtttcctat'rggcttgtacgccttg 
i citt'l' it'ixvracagcaagatggtaaccatcicitgacaagcacagccaggcagccgatgc; 
agatcaggacaccattcactgctctcagagggagtctgggtctttgccagggatagag 
atcagggtgctggtgagggccaggcttcgatcatctcccagag'rciaaattcacacagta 

(SEQ ID NO. 107} 

FIGURE I2CY 

TTTTTTT'ITTTTTTTTTTAGACTGCCTTTTTAATGAGTAGAATATGTACACACACGCACC 
ATACACAAAGCCCGGGCCCATTATAA'ITTTGTCAGGAGCTCAGGCATGCTCAGTGAGT 
TGGAAGGCAGATGAAGCATG 

CCTTCAGGTGGTGATTAGCTGGGTTCATGCCCATGTTATCGTGGAAAGCTGAGGCATC 
TGTGGTATCTCCCCCAG 1 TAACCTAGGACCTTAAGTAGC AGTGACCCACCTCCCTTCAG 
ACACAGCG 

(SEQ ED NO. 108) 

FIGURE 12CZ 

C.GA'rCCTGGAAGTCAGCATCATGCAGATAGCAGATGTCCCCATGCCCACACCGCAGCC 
TGCCAACTCCCTGATGGACTTCACTGTGACCTGCAAAGGGGCCACCCCCATGGAAGCC 
TGTACGATCATCTCCGACCCCACCTGCCAGATCGCCCAGAACCGGGTCTGCAGCCCTG 
TGGCTGTGGATGGGCTGTGCTGCTGTCTGTGAGAAGAGCCTTCAATGGGTCTGGCACC 
TACTGTGTGAATTTCACTCTGGGAGATGATCGAAGCCT 

(SEQ ID NO. 109) 



FIGURE 12DA 
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TTTTTTTTTTTTTTTTTTTCTTCTCTAAA.ATGCTATTTAACCACACTCCCTAAGCCACCA 
TATACATTTTACAATAATCCTI'AOAGAACAACAGTTCCCAC.CCACATACTTCTGCACA 
GCTCACATACATGCACAGAAGAGTGGGTTCCTGGTCAGAGGGAAGGCCAAAGACTTA 
GAGTGTCCTTGGCTTGTCTGGAGCAATGGATCCTTCTCCTGGTCTCCTCGGAAGAACG 
GGCTTT (SEQ ID NO. 110) 



FIGURE 12DB 

AAACTGCAGTGCCCGGGCCATGCCCTCCCCCTTCGCCTTCGACTCCCiCCTCCACCTTCA 
ACTCCGCCCTCACCTCCGCCCTCACCTCTGCCCACATTATCAACACCTAGCCCCTC'ITT 
AATGCCTACTGGTTACAAATCCATGGAGCTGAGTGACATTTCCAATGAAAACTGCCGA 
ATAAACAGATATGGCTACTTCAGAGCCACCATCACAATTGTAGAGGGGATCCTGGACG 
XAGCATCATGCAGATAGCAGATGTCCCATGCCCACACCGCAGCCGTCCAACTCCTGAT 
GGACTTCACTGTGACCTCAAGGGCACCCATGGAAGCTGTCAGA (SEQ ID NO. Ill) 



FIGURE 12DC 

CCTCAACGACTCTGCCATTTCCTACAAGTGGAACTTTGGGGACAACACTGGCCTGTTT 
GTCTCCAACAATCACACTTTGAATCACACTTATGTGCTCAATGGAACCTTCAACCTTAA 
CCTCACCGTGCAAACTGCAGTGCCCGGGCCATGCCCTCCCCCTTCGCCTTCGACTCCGC 
CTCCACCTTCAACTCCGCCCTC ACCTCCGCCCTCACCTCTG (SEQ ID NO. 1 1 2) 

FIGURE 12DD 

CCTCAACGACTCTGCCATTTCCTACAAGTGGAACTTTGGGGACAACACTGGCCTGTTT 
GTCTCCAACAATCACACTTTGAATCACACTTATGTGCTCAATGGAACCTTCAACCT'rAA 
CCTCACCGTGCAAACTGCAGTC.CCCGGGCCATGCCCTCCCCCTTCGCCTTCGACTCCGC 
CTCCACCTTCAACTCCGCCCTCACCTCCGCCCTCACCTCTGCCCACATTATCAACACCT 
AGCCCCTCTTTAATGCCTACTGGTTACAAATCCATGGAGCTGAGTGACATTTCCAATG 
AAAACTGCCGAATAAACAGATATGGCTACTTCAGAGCCACCATCACAATTGTAGAGG 
GGATCCTGGAAGTGAGCATCATGCAGATAGCAGATGTCCCCATGCCCACACCGCAGCC 
TGCCAACTCCCTGATGGACTTCAC'I : GTGACCTGCAAAGGGGCCACCCCCAT(iGAAGCC 
TGTACGATCATCTCCGACCCCACCTGCCAGATCGCCCAGAACCGGGTCTGCAGCCCTG 
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(SEQ ID NO. 113) 



FIGURE 12DK 

GGATCCCCTCTACAATTGTGATGGTGGCTCTGAAGTAGCCATATCTGTTTATTCGOCAG 
TTTTCATTGGAAATGTCACTCAGCTCCATGGATTICiTAACCAGTAGGCA'rTAAAGAGG 
GGCTAGGTGTTGATAATGTGGGCAGAGGTGAG 

GTGGAGGCGGAGTCGAAGGCGAACiGGGGAGGGCA'I ggcccgggcactgcagtttgca 
CGGTGAGGTTAAGGTTGAAGGTTCCATTGAGCACATAAGTGTGATTCAAAGTGTGATT 

gttggagacaaacaggccagtgttgtcccaaagttccacttgtaggaatggcagagtc 

GTTGAGG (SEQ ID NO. 114) 



FIGURE 12DF 

CCTCAACGACTCTGCCATTTCCTACAAGTGGAACTTTGGGGACAACACTGGCCTGTTT 
GTCI'CCAACAATCACACTTTGAATCACACTTATGTGCTCAATGGAACCTTCAACCTTAA 
CCTCACCGTGCAAACTGCAGTGCCCGGGCCATGCCCTCCCCCTTCGCCTTCGACTCCGC 
CTCCACCTTCAACTCCGCCCTCACCTCCGCCCTCACCTCTGCCCACATTATCAACACCT 
AGCCCCTCTTTAATGCCTACTGGTTACAAATCCATGGAGCTGAGTGACATTTCCAATG 
AAAACTGCCGAATAAACAGATATGGCTACTTCAGAGCCACCATCACAATTGTAGAGG 
GGATCCTGGAAGTCAGCATCATGCAGATAGCAGATGTCCCCATGCCCACACCGCAGCC 
TGCCAACTCCCTGATGGACTTCACTGTGACCTGCAAAGGGGCCACCCCX'ATGGAAGCC 
TGTACGA (SEQ ID NO. 115) 

FIGURE 12DG 

GAAGGTGGAGGCGGAGTCGAAGGCGAAGGGGGAGGCjCATGGCCCGGGCACTGCAGTT 
TGCACGGTGAGGTTAAGGTTGAAGGTTCCATTGAGCACATAAGTGTGATTCAAAGTGT 
GATTGTTGGAGACAAACAGGCCAGTGTTGTCCCCAAAG'LTCCACTTGTAGGAAATGGC 
AGAGTCGTTGAGGAAGTGGCTGGGATCATGAATGAGGACATCGAAGACGA 

(SEQ ID NO. 116) 
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FIGURE 1 201 1 « 
GAATTCOCACGAGGGGAGTCAGAGTCAAGCCCTGACTGGTTGCAGGCGCTCGGAGTC 

AGCATGGAAAGTCTCTGCGGGGTCCTGGGATTTCTGCTGCTGGCTGCAGGACTGCCTC 

TCCAGGCTGCCAAGCGATTTCGTGATGTGCTGGGCCATGAACAGTATCCCGATCACAT 

GAGAGAGCACAACCAATTACGTGGCTGG'rCTTCGGATGAAAATGAATGGGTTCCAATA 

TCACTTTTGTGGTGAA (SEQ ID NO. 117) 

FIGURE 12DI 

GAATTCGGCACGAGGAAGGAGGCCGTGTGCAGGCAGTCCTGACCAGTGACTCACCGG 
CTCTGGTGGGTTCCAATATCACTTTTGTGGTGAACCTGGTGTTCCCCAGATGCCAGAAG 
GAAGATCiCTAATGGCAATATCGTCTATGAGAAGAACTGCAGGAATGATTTGGGACTCi 
ACATCTGACCTGCATGTCTACAACTGGACTGCAGGGGCAGATGATGGTGACTGGGAAG 
ATGGCACCT (SEQ ID NO. 118) 



FIGURE I2DJ 

GAAGGTGGAGGCGGAGTCGAAGGCGAAGGGGGAGGGCATGGCCCGGGCACTGCAGTT 
TGCACGGTGAGGTTAAGGTTGAAGGTTCCATTGAGCACATAAGTGTGATTCAAAGTGT 
GATTGTTGGACJACAAACAGGCCAGTGTTGTCCCCAAAGTTCCACTTGTAGGAAATGGC 
AGAGTCGTTGAGGAAGTGGCTGGGATCATGAATGAGGACATCGAAGACGATGGGGAG 
GTCTCTGAGGAAGATCTCATCAGACAAGTT (SEQ ID NO. 1 1 9) 

FIGURE 12DK 

GAATTCGGCACGAGGTCAAGCCCTGACTGGTTGCAGGCGCTCGGAGTCAGCATGGAA 
AGTCTCTGCGGGGTCCTGGGATTTCTGCTGCTGGCTGCAGGACTGCCTCTCCAGGCTGC 
CAAGCGATTTCGTGATGTGCTGGGCCATGAACAGTATCCCGATCACATGAGAGAGCAC 
AACCAAT'I'ACGTGGCTGGTCTTCGGATGAAAATGAATGGATGAACACCTTGTATCCA 

(SEQ ID NO. 120) 

FIGURE 12DL 
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AAGGGGGAGGGCATGGCCCGGGCACTGCAGTTTGCACGGTGAGGTTAAGGTTGAAGO 
TTCCATTGAGCACATAAGTGTGATTCAAAGTGTGATTGTTGGAGACAAACAGGCCAGT 
GTTGTCCCCAAAGTTCCACTTGTAGGAAATGGCAGAGTCGTTGAGGAAGTGGCTGGGA 
TCATGAATGAGGACATCGAAGACGATGGGGAGGTCTCTGAGGAAGATCTCATCAGAC 
AAGTTCCTGTCATTCTTCTGGGACATGGTCACGAATACAGGGATCTGATCTGTTAT 

(SEQIDNO. I2I) 

FIGURE 12DM 

GAATTCGGCACGAGCCGACACTGTGACTCCTGGTGGATGGGACTGGGGAGTCAGAGT 
C A A G C C C T G A C T GG TT G C A G GC G CT C G G A G TC A G C A '] ' GG AAAGTCTC T G C G GG GTC C T 
GGGATTTCTGCTGCTGGCTGCAGGACTGCCTCTCCAGGCTGCCAAGCGATTTCGTGAT 
GTGCTGGGCCATGAACAGTATCCCGATCACATGACJACtAGCACAACCAATTA 

(SEQIDNO. 1 22) 

FIGURE 12DN 

AAGGTGAAAGATGTGTATGTGATAACAGATCAGATCCCTGTATTCCiTGACCATGTCCC 
AGAAGAATGACAGGAACTTGTCTGATGAGATCTTCCTCAGAGACCTCCCCATCGTCTT 
CGATGTCCTCATTCATGATCCCAGCCACTTCCTCAACGACTCTGCCATTTCCTACAAGT 
GGAACTTTGGGGACAACACTGGCCTGTTTGTCTCCAACAATCACACTTTGAATCACAC 
TTATGTGCTCAATGGAACCTTCAACCTTAACCTCACCGTGCAAACTGCAGTGCCCGGG 
CCATGCCCTCCCCCTTCGCCTTCGACTCCGCCTCCACCTTCGTA (SEQ ID NO. 121) 

FIGURE I2DO 

TACCATCGGAGAAAGAAGACCAAGCAAGGCTCAGGCAGCCACCGCCTGCTTCGCACT 
GAGC'CTCCTGACTCAGACTCAGAGTCCAGCACAGACGAAGAGGAATTTGGAGAATTG 
GAAATCGCTCTCGTTTTGTCAAGGGAGACTATCCCGATGCTGCAAGATCTGCTGTCCCT 
CTGGCCTTTGTCATCCTCGCGCCTGCGTTGTGGCCTCTGTGGGCTTGGTGTGGAGCAAA 
'I'GGCTCTCAAGGAGGACTGAGTCTCAAGGAAATT (SEQ ID NO. 1 24) 



FIGURE 12DP 
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AGCTAAGGTCAGGAGGTGTCTGAAGAATTGGCTGATGCATGGCAGGGATGTTGTTGAC 
CTGCTTTTAGAACAATACTTCCATTTAATTATAGCATATCTTATGTGTGTATTAAAGCA 
G A G C C G AT C T G G T G GG GC T CAT T AA G T AAA T G T A C T T A C TG C AAAA G G TT C AA C TG G T 
CiACCCCACiTI'TTCCCCACiAAGCAATATGATAGGACAGAGGCGACTCCTGCAAGTTGTC 
TCAGACTTCACACATACATTGTGACATTCTCTGAGCATGTGCACTGTACATGATATGAC 
ACTATCAA (SEQ ID NO. 125) 

FIGURE 12DQ 

AGCTAAGGTCCACTACCTTGTGAAGATGTATAAACACCTGAAATGTAGAAGCGATCCG 
TATGTCAAGATCGAGGGGAAGGACGCTGACGACTGGCTGTGTGTGGACTTTGGGAGTA 
TGGTGATCCATTTGATGCTTCCAGAAACCAGAGAAACCTATGAATTAGAGAAACTATG 
GACTCTACGTTCTTTTGATGACCTTAGCTAAGCCGAATCAGCACACTGGCGGCGTTAC'r 
AGTGGATCGAGCTCGTACAGCTGATGCATAGCTTGAGTATCTATAGGTTACTAATAGC 
TGGCTATCATGTCAAGCGTTC (SEQ ID NO. 126) 

FIGURE 12DR 

GCTGAGCTGCAGAGAGTAGCACATCCTTGCTAATTCAATAACTACCAGTTTTTATTGGT 
GAAACATGAATCCAGATGGTATGGTTGCTCTCCTGGACTACCGTGAAGATGGTGTGAC 
TCCATTCATGATTTTCTTTAAGGATGGCTTAGAGATGGAGAAATGTTAACAAATTGGA 
TCTATCACCTGTCACCATAATTGGCTGCTGCTTACCATCCATACAACACCAGGACTTAG 
GACAAATGGGACTGATGTCATCTTGAGCTTTTATTTTGACCTTAGCT 

(SEQ ID NO. 127) 

FIGURE 12FO 

AGCTAAGGTCAGAGCCAATAGTATCATGAGAACTGAAGAAGTAATAAAGCAACTTCT 
CCAGAAATTTAAGATTGAGAATAGCCCTCGGGATTTCGCTCTTTACATTATTTTTGGGA 
CAGGAGAGCAGAGAAAGCTAAAGAAGACCGATGTCCACTGCTGCAGAGGTTACTACA 
AGGACCATCCAAAAGCAATGCTCGGATCTCTCATGGATAAAGATGCAGAAGAATCAC 
GAGAGATGTGGCTCGTACATTATTTCACTTTCTTCTGATCATAC'rCA.AGATAGATGAGA 
GAG A AT (SEQ ID NO. 128) 
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FIGURE 12DS 

TTGACTTCTGACiTCTAACACAGACACTGCAAGGGTTAATTTTCCAAGAGG'rGGTTGTr 
GTTGACGATAAATTCATTAAGAATTTTTAAAAATT'rAGTI'AGATTTACCAAAGTCACTG 
GAGACAAATTCAGAAGGCATATATACC'rGCCAGTTTTGTGGACTACATTAATAGGGAG 
GCTTTTATGTTTGATGTAATTCTTACAGTTCTAAGAATTAAGTTCCATTGCATGAGACC 
TTAGCT (SEQ ED NO. 129) 



FIGURE 12DT 

AAGGTGAATCCCCGACGGCTCTGGGCCCGAGGAGAAGCGTCGCCGTGGCAAATTGGC 
ACTCiCAGGAGAAGCCCTCCACAGGTACTTGCiAAAAACTGGTCTCTGAGGCCAAGGCC 
AGCl'CCGAGACATTCACJGACTTCTGGATCAGCCTCCAGGGACACTGTGCAGTGAGAAG 
ATGGCCATGAGTCCTGCCAGTGAG (SEQ ED NO. 130) 



FIGURE 12DU 

AATTTTTTTTTTCGACGGCCCAACGGGGGCTTGGTGGATGGAAATATGGTTTTGTGAGT 
TATTGCACTACCTGGAATATCTATGCCTCTTATTTGCGTGTACTGTTGCTGCTGATCGT 
TTGGTGCTGTGTGAGTGAACCTATGGCTTAGAAAAACGACTTTGTCTTAAACTCJAGTG 
GGTGTTCAGGG (SEQ ID NO. 131) 



FIGURE 12DV 

CACCTGATTTAAAGGAAAAGCATTCTGACGTAAGAAGCTGAAAGGCGGCCCTTGCGTG 
CTTTGAACTTTCTTATACAGCACAGTCATCTG.AAGCTTCCTGTGTGACCAAGACAAGA 
ACGCGTGCACAAGACTGAGAAACAGCAAGAAACAACCCGGCATTCTACTTTCTCAAC 
ACTATCATACTTTAAACCTTTCAt; (SEQ ED NO. 132) 
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FIGURE 12DW 

CTAGCTTACGCTAGTCCCCCATGCATAAAGACTGATCGCTTTTCCTTAGAAAGGTGAG 
AGGGTTAGGACAAGGCCGTGTGGTAACAACACCCGCAGCTCGAAAAACCAATGGCTT 
GTTAACGTGTCAGTGAGGCACTGTACGGACGTCCATAGTCCACATCTTCAAATTCCCG 
CAGAAGGCTTCCTATTCTTAAACrCTA 

(SEQ ID NO. 133) 

FIGURE 12DX 

CTACATTl'CTGTATCCATTCCTCTGTTGAACiGCTCTGGTTCTTTCCAGCTTCTGGCTA-IT 
A'rAAATAAGGCTGCTATAAACACAGTGGAGGCATCJTGTCCTTGTTATATTTTGGAGCA 
TCTTTTGGGTATATGCCCAGAAGTGCTATAGCTGGTTCCTCAGGTAGTACTATGTCGAA 
TrTTCTGAGGAACTGCCAGACTGATTTCCAGAGTGGTTGTACCAGCTTGCAATCCCACC 
AGCAATAGAGGAGTGTTCCTCTTTCTCTATATTCTTGCCAACATCTGCTGTCACCTGAG 
TGTTT (SEQ ID NO. 134) 

FIGURE 12DY 

TGGTAAAGGGGGAATGATGTCGAGGCCATCCTGGGCTGTAGAGCCAGGCCCTGGCTTG 
GGGAGTGGGCATTGTTAACTTGTTGCTGACTTTGTGTTGACCCCTGCATCAGCAACTAT 
TTCCTTAAATCCAGGATACAACTTGTTAAG TGTGACAGCTTTCCTTTACACACCA TTTT 
TGTGGGTGTATATATATATTTGACTTGGGGAGAATTATTTTTTACAAAAATACAAAAT 
AGCTTTTAA (SEQ ID NO. 135) 

FIGURE 12DZ 

AGCTAAGGTCCGGACTCTATGGCATGACCCCAAAAACATTGGCTGGAAAGATTACACT 
GCCTACAGGTGGCACCTGATTCACAGGCCTAAGACAGCiCTACATGAGAGTClTAGTGC 
ATGAACiGAAAGCAAGTCATGGCTGACTCAGGACCAATTTATGACCAAACCTACGCTG 
GTGGACGGCTGGGCTGTTTGTCTTCTCCAAGAGATGCiTCTATTCTCCiGACCTCAAGTAT 
GAGTGCAGAGATGCTAGAGAGCAGGCTCAGTCTCAGCA 

(SEQ ID NO. 136) 



FIGURE 12EA 
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TGACCTACCiTGTAGTTCKriXVItiCTTGTTGTCGAAGATGACJGGCCTCCTGGATGAGCTG 
GTGCTGCTGCTCCAGCAGGTCCAGGCTGGGCTTGTAGTCCACGAGTCTGCGCTCGTAC 
TGCTTCAGGTGGCTCAGCTGGTCTTCCAGAGTCCCGT'rCATCTCAATGGAGATGCGCCC 
GATCTCCTCCATCTTAGTCTGGATCCACGGCCCCACCATATTGGCTTGGCTGGCGAACT 

GTCGGCGAAGGCTGCATTGGATTGCT 

(SEQ ID NO. 1371 

FIGURE 12EB 

AATTTTTTTTTTCGACGGCCCAACGGGCrGCTTCiGTGGATGGAAATATGGTTTTGTGAGT 
TATTGCACTACCTGGAATATCTATGCCTC'ITA'ITTGCGTGTACTGTTGCTGCTGATCGT 
TTGGTGCTGTGTGAGTGAACCTATGGCTTAGAAAAACGACTTTGTCTTAAACTGAGTG 
GGTGTTCAGGG (SEQ ID NO. 138) 

FIGURE 12HC 

CACCTGATTTAAAGGAAAAGCATTCTGACGTAAGAAGCTGAAAGGCGGCCCTTGCGTG 
CTTTGAACTTTCTTATACAGCACAGTCATCTGAAGCTTCCTGTGTGACCAAGACAAGA 
ACGCGTGCACAAGACTGAGAAACAGCAAGAAACAACCCGGCATTCTACTTTCTCAAC 
ACTATCATACTTTAAACCTTTCAC 

(SEQ ID NO. 139) 

FIGURE 12ED 

CTAGCTTACGCTAGTCCCCCATGCATAAAGACTGATCGCTTTTCCTTAGAAAGGTGAG 
AGGGTTAGGACAAGGCCGTGTGGTAACAACACCCGCAGCTCGAAAAACCAATGGCTT 
GTTAACGTGTCAGTGAGGCACTGTACGGACGTCCATAGTCCACATCTTCAAATTCCCG 
CAGAAGGCTTCCTATTCTTAAACTCTA 

(SEQ ID NO. 140) 

FIGURE I2EE 

CTACATTTCTGTATCCATTCCTCTGTTGAAGGCTCTGGTTC'ITTCCAGCTTCTGGCTATT 
AI'AAATAAGGCTGCTATAAACACAGTGGAGGCATGTGTCCTTGTTATATTTTGGAGCA 
rCTTTTGGGTATATGCCCAGAAGTGCTA't'AGCTGGTTCCTCAGGTAGTACTATGTCGAA 
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TTTTCTGAGGAACTGCCAGACTGATTTCCAGAGTGGTTGTACCAGCTTGCAATCCCACC 
AGCAATAGAGGAGTGTTCCTCTTTCTCTATATTCTTGCCAACATCTGCTGTCACCTGAG 
TGTTT (SEQ ID NO. 141) 

FIGURE I2EF 

TGGTAAAGGGGGAATGATGTCGAGGCCATCCTGGGCTGTAGAGCCAGGCCCTGGCTTG 
GGGAGTGGGCATTGTTAACTTGTTGCTGACTTTGTGTTGACCCCTGCATCAGCAACTAT 
TTCCTTAAATCCAGGATACAACTTGTTAAGTGTGACAGCTTTCCTTTACACACCATrTT 
TGTGGGTGTATATATATATTTGACTTGGGGAGAATTATTTTTTACAAAAATACAAAAT 
AGCTTTTAA (SEQ ID NO. 142) 

FIGURE 12EG 

AGCTAAGGTCCGGACTCTATGGCATGACCCCAAAAACATTGGCTGGAAAGATTACACT 
GCCTACAGGTGGCACCTGATTCACAGGCCTAAGACAGGCTACATGAGAGTCTTAGTGC; 
ATGAAGGAAAGCAAGTCATGGCTGACTCAGGACCAATTTATGACCAAACCTACGCTG 
GTGGACGGCTGGGCTGTTTGTCTTCTCCAAGAGATGGTCTATTCTCGGACCTCAAGTAT 
GAGTGCAGAGATGCTAGAGAGCAGGCTCAGTCTCAGCA 

(SEQ ID NO. 143) 

FIGURE 12EH 

TGACCTACGTGTAGTTGGTGTGCTTGTTGTCGAAGATGAGGGCCTCCTGGATGAGCTG 
GTGCTGCTGCTCCAGCAGGTCCAGGCTGGGCTTGTAGTCCACGAGTCTGCGCTCGTAC 
TGCTTCAGGTGGCTCAGCTGGTCTTCCAGAGTCCCGTTCATCTCAATGGAGATGCGCCC 
GATCTCCTCCATCTTAGTCTGGATCCACGGCCCCACCATATTGGCTTGGCTGGCGAACT 
GTCGGCGAAGGCTGCATTGGATTGCT 

(SEQ ID NO. 144) 

FIGURE 12EI 

TGACCATCGATAAGTTTAATAACTACAGACTTTTCCCAAGACTACAAAAGCTTCTTGA 
AAGTGACTACTTTAGATATTACAAGGTGAACTTGAAGAAGCCTTGTCCTTTCTGGAA'r 
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GACATCAACCAGTGTGGAAGAAGAGACTGTGCCGTCAAACCCTGCCATTCTGATGAAG 
TTCCTGATGGAATTAAGTCTGCCGAGCTACAAGTATTCTG 

AGGAAGCCCAACCGCATTGAAGAATGTGAGCAAGCTGAGCG (SEQ ID NO. 145) 

FIGURE 12EJ 

AACTCTGTGAACCGTGCCTTTCTCTGTGGACKtTGGAGGTGTCGGTTGAAGACAAGCGA 
GGTCCTCCAAGGGGCTG'l Cn'CTTATGTTGCCATCTCCCCTTGTAGCTTGGCTGCCCACC 
CTCCAGACTGTGCGCCATGGCTCCAAGGCTGTGACCCGCCACTGGAGTCATGCACTTC 
CAGCGGCAGAAGCTGATGCTATAACTGAGTATATTCCTCCAAACCTGCCATCAACCCG 
AGA (SEQ ID NO. 146) 



FIGURE 12EK 

ACTTCTCCAGAGAATTTAAGATTGAGAATAGCCCTCGGGATTTCGCTCTTTACATTATT 
TTTGGGACAGGAGAGCAGAGAAAGCTAAAGAAGACCGATGTCCCACTGCTGCAGAGG 
TTACTACAAGGACCATCCAAAAGCAATGCTCGGATCTTCCTCATGGATAAAGATGCAG 
AAGAAATCAGCAGAGATGTGGCTCCGTACATTAATTTCACTTTTCTTTCTTGGATCCAT 
CCTTCAAGATTAGATGAAGAAGAGAAATGGAGATTGAGAGAATATGCAATCATACCGA 

(SEQ ID NO. 147) 

FIGURE 12EL 

AGGGTTACTTCAGGCTAAGGCAATAGAAATCCATTTTAAGATGGTGTGCTAAACiGCTT 
GATGGATGTTCATCGTCTGTCTAAAGGAGAATGAAGTCATCAACAGGATGTCAGGGGA 
AAGTGAGATCATCGCAGAAAGTATCAACTTAGCACAAACACACAGGCATAGCTCCTG 
CAAGAGGTGAATGClTj'rCCCCAAATACCTGAGGAACTATCCCTT'rGGGCAAGAAAATA 
G A C A A GTCC A T( i AA G TCTGGGTG A 

(SEQ ID NO. 148) 

FIGURE 12EM 

GACCAGGTACACTTGAGCAAAGCACCCAGTATTTAATTCCTTACAGAAAGGAGAGGA 
AAGGTCTGCAGTTGGACTGATCiGTA'rCiCTAACACCGCAAATGACTGTCATTTGATCTC 
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AGAAGTTCAGGATTGATTGCTATGTTTTAGCTCTAATTGTGAGAAACAGTAGTCATTTT 
AGTCTTAAATTTTGCCCTCAGGAAATTCAGGGAGACTGAGCCTTCCTTCCCCCACCTTC 
GTAAAGCCOAATTCCAGCACACOGCOGCCGTTACTAGTOGATCCGAGCTCG 

(SEQ ID NO. 149) 

FIGURE I2EN 

TACAAGGTGGGATGGCAGGAACTGAAGGCTTCTGTAAATCCAGTTTTGGCTCTCTCTC 
TGGTCTTTCTTTCTCTTCTGTTCTGTTTGGAAGGGTTTCTGGTCTTTCAGGAGGTATTTT 
TTTAATTTCATGTTTTCTCTCTGTGGTACCTGCCCCTTGTTTGACGACAGGAGCTGAIG 
GAGGTGGCGGTTTCTTGGGTCTATTCCCTTCCTTGTCAAAGTCCGATGGAAGTAACTTC 
ACGAAGTTGTCAGGAAACACGCCTCGTCTGCCATTGAGTTCTCCTTCCCACCAGCCTA 
CGCGATGCAGTCTTATTGATGAGAGTCACTA TATCTCCTTA (SEQ ID NO. 150) 

FIGURE 12EO 

TCACCCATGACTTCTATGGACTTGTCTATTTTCTTGCCCAAAGGGATAGTTCCTCAGGT 
ATTTGGGGACAGCATTCACCTCTTGCAGGAGCTATGCCTGTGTGTTTGTGCTAAGTTGA 
TACTTTCTGCGATGATCTCACTTTCCCCTGACATCCTGTTGATGACTTCATTCTCCTTTA 
GACAGACGATGAACATCCATCAGGCCTTTATGCACACCATCTTAAAATGGATTTCTAT 
TGCCTTAGCCTGAAGTCC (SEQ ID NO. 151) 

FIGURE 12EP 

CCCATAGAGATAGGT'ITGCTCCAGAACCTGCAGCATTTGCACATCACAGCiGAACAAGG . 
TGGACATTCTGCCAAAACAGTTGTTTAAGTGCGTGAAGTTGAGGACTTTGAACCTGGG 
GCAGAACTGTATCGCCTCCCTGCCTGAGAAAATCAGTCAGCICACCCAGCrCACTCAG 
CTGGAGCTGAAGGGCAACTCCCTAGACCGCCTGCCAGCCCAGCTGGCAGTGTCGATGC 
TCAAGAAGA (SEQ ID NO. 152) 

FIGURE 1 2EQ 

CAATAATCCAGGTAAAATAGAGTAAAATAGTCTGCTAGCAGCAAGTTCCTACCATACT 
TTCAACAACACTCACGAGATACGGAATGATTACAGCATTAAGAATATTTCAGAAATGA 
CAGGTAGC.TGTGGTGGACAGGTGGCTCACATTCAAGACTCAAGTCTACTTAAAAAAGA 
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AAATCTCACTAGCACTAGATTCTAGCTCCTTTGTTTCCCCCTTTCTTTTGGTTTCAAAG 
GCGTTTCTACAACCCATAAGAC.G 

(SEQIDNO. 1 53) 

FIGURE 12ER 

GCCAAGCTATTATGACACTATAGATACTCAACGTATCGATCAACGTTGGTACCGAGCT 
CGGATCCAC T AGT AA C GG C C GC C AGTG TO C TO G AATT C G G C TTGG ATT G G T C A G AG C A 
GTGTGCAATATGATCCAACTAAGTCTCCTCCCrrGGCCCCTCCCCAAAATGTTTGCAGT 
GTTATTTTTGTGGGTTTriTTTTAACACCCTGACACCTGTTGTGGACATTGTCAACCTTT 
GTAAGAAAACCCAAATAAAAATTGAAAAATAAAATAAAAAGAAACCCATGAACATTC 
GCACCACTTGTGGCTTCTGACTATCTTCCACAGAGGGAAGTTTAAAACCCAAACTTCC 
AAAGGTTTGAACTACCTCAAGACACTTTCGCAGTGGAGTCGTAGACCAATCCCA 

(SEQIDNO. 154) 

FIGURE 12ES 

TAAATAAATTAAAAAACTATTAAACCTAAAAACGTCCACCAAACCCTAAAACCATTAA 
ACAACCAACAAACCCACTAACAATTAAACCTAAACCTCCATAAATAGGTGAAGGCTTT 
AATGCTAACCCAAGACAACCAACCAAAAATAATGAACTTAAAACAAAAATA 

(SEQIDNO. 155) 

FIGURE 12ET 

GGTAAAGGGGACCTGGAGAACGCCTTCCTGAACCTGGTCCAGTGCATCCAGAACAAG 
CCCCTGTACTTCGCTGACCGGCTGTACGACTCCATGAAGGGCAAGGGGACTCGAGACA 
AGGTCTGATTACiAATCATGGTCTCTCGCAGTGAAGTGGACATGCTGAAAATCAGATCT 
GAATTCAAGAGGAATATGGCAAGTCCTGTACTACTACAT (SEQ ID NO. 156) 

FIGURE 12EU 

AGAGCAGCAGGCCAGCTGTACTTGGTTTGGCAAGAAAAAGAAGCAGTACAAAGATAA 
ATATTTGGCAAAGCACAACGCAGTGTTTGATCAATTAGATCTTGTCACATATGAAGAA 
GTAGTCAAACTGCCAGCATTCAAAAGGAAAACATTAGTCTTATTAGGTGCACATGGTG 
TTGGAAGAAGACACATAAAAAATACCCTCATCACAAAGCAC (SEQ ED NO. 1 57) 
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FIGURE 12EV 

TCGGTCATAGTAGTAAGGGAAATCTCCCAGGTAAGATGAATACTGCGGTAGGACGAA 
CAATCCTCCAGGATGTTTGTTCCATATTAAACTGTTACGTGATATGTGCTTGAATATTC 
TGTCCTGAATAATCTCTAGTGTAGTTAATACAATCTTCTCAACTGAAGAAAAATAAGC 
Cl'CCCACAAGAACTGTGTCTGCTGTCTAAGTGCTAGGATTTTATCCTGATGAATAGACC 
TGATTGTAGAAGGAATCTG'IAATAGCAATCTCTCATCGCCTATGACCGAAAGCCGAAT 
TCTGCAGATATCCATCACACTCiGCCGGCCGCTCGAGCATCGATCTAGAGGG 

(SEQ ID NO. 158) 

FIGURE 12EW 

CTGCTTGATGACAAAGGGTGTAGTCTTCATCTTTTCCTGGATTATTTTGGAAGTGACAC. 
GTGGAAATTCCATCGTCACGTTTATGTGGTCTGTAAAGCCAACGATCTCAAATTCTGG 
CGGCTCAAGACiGAGCC.TTTGCAGGCACGATGTAGTCTGAGCAGCGGCACACGGTCAA 
GTCCCCTCTGTGCACTATGACGATGGCGACGACCiTAGCTCTCCATGCCCTCCAACCAC 
TTATCTGTCACGTCACATGATGACTTCGTGGTATCTGAACAGTTCTTAACCTTCGTCAG 
ATTTTCGTCTTT (SEQ ID NO. 159) 

FIGURE 1 2 EX 

AAATCGTTGCTTCAGAAAGACTCAATAACACTTACTTGTGCCTGGCTGTGCTC3ACAGT 
ACATTCTGTGTCATTTTCCTTCATGGGCGGAACAGTCCACAGAGCTCACCAACAAGTA 
CTCCAAAACTGAGCAAGAGTTTAAGCTTCGAGATGCAACCAGATGAGCTTCTAGAAAA 
GCCCATGTCTCCCATGCAGTACGCACGGTCTGGACTAGGGACAGCAGAGATGAATGGC 
AAACTCATAGCTGCAGGTGGTTATAACAGAGAGGAATGTCTTCGAACAGTTGAATGCT 
ATGATCCACATACAGATCACTGGTCCTTCCTTGCTCCCATGAGAACATCAAGCAG 

(SEQ ID NO. 160) 

FIGURE I2EY 

CTTTCCGAAGAGCACACCCTCCTCTCAATGAGCTTGTGAGGTCTCTTTCTTCTCTTCCT 
TCCAACGTGGTGCTAGCTCCAGGCGAGCGACGTGAGAGTGCCACCTGAGACAGACAC 
C TT G G T C T C AG TT AG AAG G AA G ATG C A G GTC T AA G A G G AAT C C C C G C A G G TCT G TCI G 
AGCTGTGATCAACiAATATTCCGCAATCnXiCCTTTTCTGAGATCGTGTTAGCTCCAAAC; 
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CTTTTTCCn'ATCGCAGAGTGTTCAGTTTGTGTTTGTTTCiTTTTTGTTTTGTTTTGTTTlTC 
CCTTGGCGGATTTCCCGTGTGT < SE Q 10 N0 16 1 ) 

FIGURE 12EZ 

CCTATTGAACGGTCTTGCAATGACGAGCATTCAGATGCTTAAGGAAAGCATTGCTGCT 
ACAAATATTTCTATTTTTAGAAAGGGTTTTTATGCiACC'AATGCCCCAGTTGTCAGTCAA 
AGCCGTTGGTGTTTTCATTGTTTAAAATGTCACCTATAAAACGGGCATTAmATGTTT 
TTTTTCCCTTTGTTCATATTCTTTTGCATTCCTGATTATTGTATGTATCGTGTAAAGGAA 
GTCTGTA (SEQ ID NO. 162) 

FIGURE 12! ; A 

CCTATTGAACGGTCTTGCAATGACGAGCATirAGATGCTTAAGGAAAGCATTGCnGCT 
ACAAATATTTCTATTTTTAGAAAGGGTTTTTATGGACCAATGCCCCAGTTGTCAGTCAA 
AGCCGTTCKITGTTTTCATTGTTTAAAATGTCACCTATAAAACGGGCATTATTTATGTTT 
TTTTTCCCTTTGTTCATATTCTTTTGCAT'rCCTGATTATTGTATGTATCGTGTAAAGGAA 
GTCTGTA (SEQ ID NO. 163) 

FIGURE I2FB 

CCTGGGTCCGTCCTCCAACCCCTCACGCCCAAACCCTCCGACTTTCACTTCTTGAAGTG 
ATCGGAAAGGGCAGTTTTGGAAAGGTTCTTCTGGCTAGGCACAAGGCAGAAGAAGTA 
TTCTATGCAGTCAAAGTTTTACAGAAGAAGCCATCCTGAAGAAGAAAGGAAGGAAGC 
ATATTATGTCAGAGCGGAATGTTCTGTTGAAGAATGTGAAGCACCCTTTCCTGGTGGG 
CCTTCACTTCTCATTCCAGACCGCTGACAAGCTCT (SEQ ID NO. 1 64) 

FIGURE 12FC 

GATGCTGAACACAAAAAGAAAGAAGAAAAGGAAGAGCrAGGAGCAAGAGAAGCTGAA 
GGGAGGGAGCCTTGGCGAAAATCAGATCAAAGATGAGAAGATTAAAAAGGACAAAG 
AGCCCAAAGAAGAGTCAAGAGCTTCTTGGATAGAAAGAAAGGATTTACAGAGTGAGG 
CGCAGAATGGAGATTCATGACCCACAAACTTAAAC 

(SEQ ID NO. 165) 
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FIGURE 12FD 

AAAGCCAATTGGTAGAGAAATTGAAGACACAAATGCTGGATCAGGAAGAGCTTCTGCJ 
CATCAACCAGAAGGGATCAAGATAATATGCAAGCTGAACTGAATCGCCTCCAAGCAG 
AAAATGATGCTTCTAAAGAAGAGTAAAGAGTTTTACAGGCCTTAGAGGACTGCTGT'l'A 

ATTATGATCAGAGTTCAGGAGTTAAGAC 

(SEQ ID NO. 166) 

FIGURE 12FE 

CTGCTTGATGTCCTGTGTAGCGAATGTCACAGCGTACAACATTGTTAGTGTAGTCTGAT 
TCAGGCACCAGGTAGCTGGGGTTTACACTGACCTTTAGAATGTAGTTTCCAGGTTGTA 
CATCTGTAATATCAATCCACTGGCAGTCTATGTCTGCCGCATAGGTGTCATAACATCCA 
GGACTCAATCCCTGTGTGTGTGCAGTGCACGCAAAGGCCCTGTGGTACCCATAGTCAC 
AGGACGTGTCCTCCAGACAGAAGCTTGCTTTGTGGCCTTCAGCCACTCTCCTCTGTGTG 
TTGGCATCAACGAGAAGCCGAATTCTCGAGATATCCATCACACT (SEQ ID NO. 167) 

FIGURE 12FF 

CTGCTTGATGTCCTGTGTAGCGAATGTCACAGCGTACAACATTGTTAGTGTAGTCTGAT 
TCAGGCACCAGGTAGCTGGGGTTTACACTGACCTTTAGAATGTAGTTTCCAGGTTGTA 
CATCTGTAATATCAATCCACTGGCAGTCTATGTCTGCCGCATAGGTGTCATAACATCCA 
GGACTCAATCCCTGTGTGTGTGCAGTGCACGCAAAGGCCCTGTGGTACCCATAGTCAC 
AGGACGTGTCCTCCAGACAGAAGCTTGCTTTGTGGCCTTCAGCCACTCTCCTCTGTGTG 
TTGGCATCAACGAGAAGCCGAATTCTCGAGATATCCATCACACT (SEQ ID NO. 168) 

FIGURE 12FG 

GATCTGACACl'ACAGCATGAGCGTTAGATTTCATAAAATTATTTTTCTTCTAAATGCTG 
GAAACTCTAAGGGTTTATTCAGAAAAAAAACTGGCCAATTTTCAAATGGCTTAGAAGC 
AGGGTTAATTAAGTATTGAATGAGCCACTGTGATATCCTGATGACACCCAGTCACAAT 
GACAGTTTTGAAGCATACAACCAAAACAATTGAGATCTCAAAACTATTITACATCACT 
TATGGTAATGTTATGTAAAAATCJAAAATGCTTTCTGTGGAAGTTACATTCTTTACCAGG 
TCTTTAACATAAATTAACACGACGTCGAGTAAGCCTTTGTTCGGAAGACAAACTAGTT 
TGTGAGTTCAGTCAGATCCCAGCT (SEQ ID NO 169) 
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FIGURE 12FH 

AGTTGCCAGGACCACCACCATAGTTGCCAGGTTCATCATAAACAAATCCAACATCAAT 
CTTAAATTCCCCCATCAGACAATCTGCCCTCAAAGAATGGGAATTATAAACCCGGATA 
CTGATGATCTCATCCATGAGCTCAGAGGGTGTGATGTGCACATTGTAGAAAAATAACT 
CGTCAAAAAACGGATTGTTCCCTCTCTTGATTCTCGTGCGATGCGTCTGACCACAGATG 
TGAACTTTCACCACGGGCCTTATGTTGTTGCCGCATAACTC1ACGGCCCTCGATCACTCT 
GAC ACGGATCTGGAAATCTGTGGCTTGTTGG AC AGC ATCCTT (SEQ ID NO. 1 70) 

FIGURE 12FI 

AAGCCGTGTCCCAAAGAATGGATAGAGACGCGATCAGATGCGACAGTGCTGTGGAGA 
AAGCCCAGGAACCI'GCACAATTGCCCTGGTCCAATGGCTCGTGGATCAGGTTGGGCCA 
CTTCTCTGAAGCTTCAAAGGCAGTGGGTAGCACT'rCCCCTTGGCCCAGCACCGTATAA 
ATCTCATTCATATTCATGACAGTGGAGGATGGGCGGATTGTGCCCAGGCGGTACGGAA 
TGCCCTCATCCAGGGTCATGCCCCAGAAGGCACTGTGGTTCCCAGCCTGCCACCCGTA 
GTTGCCTCGGTTGATGGCTTTAATCATGTCTGGTCACTAGACACGGCTTAAGCGAATCT 
CG AG AT ATCC ATC AC ACTGGCGGCGTCG AGAT (SEQ ID NO. 171) 

FIGURE 12FJ 

AAGCCGTGTCTGATGATGGAGGTAGTGGTGGGGGAGGAGGGACTGAGGGTCCTGAGG 
TGGTGGCCCCTGGAACTGATCCCACATAGTTACCCACTGCTAGTTCTGACCCCGTGGA 
CAACGTGCCAGAGGCCATGACTGGCAGTATGGCAATGTCCCCATCCCCTTTCTTCTTA 
ATTTTAATGGTCCCTTGTTTCTCCAGTTCGTGAATCTTTTTTTCCAGGGTAGACTGTCTT 
TGAATGGCTTCTTCCTTTTCTTTGACCATTTTTCTTAACGTGTGAACTTGGGTATTTGCA 
TCTTTGTAGATTTCCGGACAACATCAGTTCCTTATTCCTCTGCATAAGTTGCTTTCAGTT 

(SEQ ID NO. 172) 

FIGURE I2FK 

CGAGTCAGACACATGAAAGCAAAACGCGGGCAGATAAAACGATCGCCTTACCTTCTA 
GCAAAAATCTGAAGCTTGTGTCAGAAACAAAGACTCAGAAAGGTTTGTTTTCAGATGA 
AGAAGACTCTGAGGATTTGTTTTCTTCTCAAAGT'rCAAGTAAGCCAAAAAGTGCATCA 
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CTTTCATCCAGCCAGCCCCCAACATCAGTCTCCCTTTTTGGTGATGAAGATGAAGAGG 
ACAGTCTTTTTGGGAGTGCAGCAGCTAAGAAGCAGACTTCATCTCTACAACCTCAGAG 
TCAAGAGAAAGCAAAGCCTTCCGAGCAGCCCTCAAAGAAGACATCTGCCTTGTTGTTC 

AGA 

(SEQ ID NO. 173) 

FIGURE 12FL 

CGAGTCAGACTTAATTTAAAAACGAAACAAAACAAAAATAACATAGTTTAGAAATCA 
AGGAGAAAGGACAGATAGTCTAAGAAAAAAGACAACACAAAAGAGGGGCAGGGCGG 
CCAGCTTCiCATCAGGGATCTTGGCTCfGACjACCTGCTTTGAATAGGTTTCTTGCAGGTAT 
Ml'CTTAAATGCTGTGGGGTTTlTCCAGAGTTCCGCAGCCVrGTGTGTTCAAAGGGCTATC 
GATGTTGGGTTCTCCTAGCAGGCTCTGGATAGAGAGCAAGATAGTCCTGACATCATAT 
AGTGCAGACCACTTATCCTTGAGGATGTCCGGCAGATC.TTGCCTGGGTGTCACGTTGG 
GGTGGTAGCAGGGTGTGAGGAACTTCACTG (SEQ ID NO 174) 

FIGURE 12FM 

CGAGTCAGACACTCCTGGCTCCTGGATTCTTTAGATGCCTCCATCAGACTGGGTACTTT 
AGATGCCTCCATCAGACTACTTCGTCATTGTATTTCTCAGTTCGCTCAGGGCAAGCGGC 
AGTCTCTGGGCTGCTGTGGCAGGTGCCACCACTGCATTTAAAAGTTAAAATTTCTTCA 
AATATTCCCATCAAGGCCTTGTAGCCTCTGAGATTGGTTTACTATTTGCCCAGTTATTT 
AAAGCTCTCTGCATTCCTTCCTGATTTAATATTGCTATGGCCAGGACAATGTGTAGAAG 
TAAAAAGGATATCATATTTACAGGTGTAACGC 

(SEQ ID NO, 175) 
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